I have ~2 million rows or so of data, each row with an artificial

Question

0

Asked: May 19, 20262026-05-19T00:25:30+00:00 2026-05-19T00:25:30+00:00

I have ~2 million rows or so of data, each row with an artificial

0

I have ~2 million rows or so of data, each row with an artificial PK, and two Id fields (so: PK, ID1, ID2). I have a unique constraint (and index) on ID1+ID2.

I get two sorts of updates, both with a distinct ID1 per update.

100-1000 rows of all-new data (ID1 is new)
100-1000 rows of largely, but not necessarily completely overlapping data (ID1 already exists, maybe new ID1+ID2 pairs)

What’s the most efficient way to maintain this ‘set’? Here are the options as I see them:

Delete all the rows with ID1, insert all the new rows (yikes)
Query all the existing rows from the set of new data ID1+ID2, only insert the new rows
Insert all the new rows, ignore inserts that trigger unique constraint violations

Any thoughts?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-19T00:25:31+00:00

Editorial Team

2026-05-19T00:25:31+00:00Added an answer on May 19, 2026 at 12:25 am

Not all of your listed solutions are functionally equivalent, so without more knowledge about what you want or need to accomplish, it’s hard to say which is most appropriate.

You may lose data that you want or need to keep.
Based on the table schema that you mentioned, this should be reasonable.
This will only work if you perform each INSERT separately.

I’d suggest [2] based on the available info.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have ~2 million rows or so of data, each row with an artificial

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply