If I want to remove lines where certain fields are duplicated then I use

Question

0

Asked: May 19, 20262026-05-19T22:36:17+00:00 2026-05-19T22:36:17+00:00

If I want to remove lines where certain fields are duplicated then I use

0

If I want to remove lines where certain fields are duplicated then I use sort -u -k n,n.
But this keeps one occurrence. If I want to remove all occurrences of the duplicate is there any quick bash or awk way to do this?

Eg I have:

1   apple   30
2   banana   21
3   apple   9
4   mango   2

I want:

2 banana   21
4 mango   2

I will presort and then use a hash in perl but for v. large files this is going to be slow.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-19T22:36:18+00:00

Editorial Team

2026-05-19T22:36:18+00:00Added an answer on May 19, 2026 at 10:36 pm

Try sort -k <your fields> | awk '{print $3, $1, $2}' | uniq -f2 -u | awk '{print $2, $3, $1}' to remove all lines that are duplicated (without keeping any copies). If you don’t need the last field, change that first awk command to just cut -f 1-5 -d ' ', change the -f2 in uniq to -f1, and remove the second awk command.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

If I want to remove lines where certain fields are duplicated then I use

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply