I am new to bash programming (grep/uniq/sort/etc…) and I am having trouble trying to

Question

0

Asked: June 17, 20262026-06-17T16:58:10+00:00 2026-06-17T16:58:10+00:00

I am new to bash programming (grep/uniq/sort/etc…) and I am having trouble trying to

0

I am new to bash programming (grep/uniq/sort/etc…) and I am having trouble trying to remove duplicates from a file with the given format

--
name: joe
tag: 123
--
name: mike
tag: 000
--
name: dave
tag: 123
--
name: loopy
tag: 123
--

Basically what I want is to remove the duplicates in the file which have the same tag number, like this:

--
name: joe
tag: 123
--
name: mike
tag: 000
--

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-17T16:58:10+00:00

This task is a pretty good fit for awk. If you have gawk or mawk available, you can accomplish it by setting the record separator appropriately:

awk -v RS='--\n' -v ORS='--\n' '!h[$4]++' infile

Output:

--
name: joe
tag: 123
--
name: mike
tag: 000
--

This works by remembering which tags have been seen (h[$4]++), i.e. fourth element in each record. The bang (!) in front of the increment ensures that the condition is only true when h[$4] is zero, so the default rule ({ print $0 }) is only invoked the first time tag is seen.

A slightly shorter version:

awk '!h[$4]++' RS='--\n' ORS='--\n' infile

Edit – handle records where name fields have spaces

The field count would vary if the name field has spaces. You can handle this by doing the field splitting a bit differently:

awk '!h[$4]++' RS='--\n' ORS='--\n' FS='\n| *: *' infile

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am new to bash programming (grep/uniq/sort/etc…) and I am having trouble trying to

Leave an answerCancel reply

1 Answer

Edit – handle records where name fields have spaces

Leave an answer
Cancel reply