I have a very big text file (few GB) that has the following format:

Question

0

Asked: June 13, 20262026-06-13T10:58:23+00:00 2026-06-13T10:58:23+00:00

I have a very big text file (few GB) that has the following format:

0

I have a very big text file (few GB) that has the following format:

File is already sorted and double lines were removed. There are repeated pairs like ‘2 1’, ‘4 3’ reverse order that I want to remove. Does anybody have any solution to do it in a very resource limited environments, in BASH, AWK, perl or any similar languages? I can not load the whole file and loop between the values.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-13T10:58:24+00:00

Editorial Team

2026-06-13T10:58:24+00:00Added an answer on June 13, 2026 at 10:58 am

Possible solution:

Scan the file
For any pair where the second value is less than the first, swap the two numbers
Sort the pairs again by first then second number
Remove duplicates

I’m still thinking about more efficient solution in terms of disk sweeps, but this is a basic naive approach

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a very big text file (few GB) that has the following format:

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply