i have a file like this: term1 term2 term3 term4 term2 term1 term5 term3

Question

0

Asked: May 14, 20262026-05-14T05:28:21+00:00 2026-05-14T05:28:21+00:00

i have a file like this: term1 term2 term3 term4 term2 term1 term5 term3

0

i have a file like this:

term1 term2
term3 term4
term2 term1
term5 term3
..... .....

what i need to do is to remove duplicates in any order they appear, such as:

term1 term2

and

term2 term1

is a duplicate to me.
It is a really long file, so I’m not sure what can be faster.
Does anyone has an idea on how to do this? awk perhaps?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-14T05:28:21+00:00

Editorial Team

2026-05-14T05:28:21+00:00Added an answer on May 14, 2026 at 5:28 am

Ordering each word in the line and sorting is easy with perl.

./scriptbelow.pl < datafile.txt | uniq

#!/usr/bin/perl

foreach(sort map { reorder($_) } <>) {
    print;
}

sub reorder {
    return join(' ', sort { $a cmp $b } split(/\s+/, $_)) . "\n";
}

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

i have a file like this: term1 term2 term3 term4 term2 term1 term5 term3

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply