I have a legacy DB with: firstname, lastname, address1, address2, address3, address4, zipcode The

Question

0

Asked: May 13, 20262026-05-13T01:59:51+00:00 2026-05-13T01:59:51+00:00

I have a legacy DB with: firstname, lastname, address1, address2, address3, address4, zipcode The

0

I have a legacy DB with:
firstname, lastname, address1, address2, address3, address4, zipcode
The data is scattered between the different columns with no consistency eg the actual zipcode could be in any column and there are plenty of typos.

Is there a way I could use something like SOUNDEX / DIFFERENCE in a SP to loop through everything and return an ordered list of likely duplicates?
[it doesn’t need to be fast]

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-13T01:59:51+00:00

Editorial Team

2026-05-13T01:59:51+00:00Added an answer on May 13, 2026 at 1:59 am

If you are using SQl server 2005 or above, you can use fuzzy matching in SSIS to do this task. I found that I got significantly better results in doing this than in looking for soundex matches or writng my own sql scode to look for near matches.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a legacy DB with: firstname, lastname, address1, address2, address3, address4, zipcode The

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply