I have a problem coming up with an algorithm. Will you, guys, help me

Question

0

Asked: May 22, 20262026-05-22T14:56:55+00:00 2026-05-22T14:56:55+00:00

I have a problem coming up with an algorithm. Will you, guys, help me

0

I have a problem coming up with an algorithm. Will you, guys, help me out here?

I have a file which is huge and thus can not be loaded at once. There exists duplicate data (generic data, might be strings). I need to remove duplicates.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-22T14:56:56+00:00

Editorial Team

2026-05-22T14:56:56+00:00Added an answer on May 22, 2026 at 2:56 pm

One easy but slow solution is read 1st Gigabite in HashSet. Read sequential rest of the file and remove duplicit Strings, that are in file. Than read 2nd gigabite in memory(hashset) and remove duplicit in files and again, and again…
Its quite easy to program and if you want to do it only once it could be enough.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a problem coming up with an algorithm. Will you, guys, help me

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply