I have an input file as following. I need to break them into multiple

Question

0

Asked: June 12, 20262026-06-12T05:18:19+00:00 2026-06-12T05:18:19+00:00

I have an input file as following. I need to break them into multiple

0

I have an input file as following. I need to break them into multiple files based on the columns 2,3&5. The file has more columns but i have used cut command to get only the required columns.

12,Accounts,India,free,Internal
13,Finance,China,used,Internal
16,Finance,China,free,Internal
12,HR,India,free,External
19,HR,China,used,Internal
33,Finance,Japan,free,Internal
39,Accounts,US,used,External
14,Accounts,Japan,used,External
11,Finance,India,used,External
11,HR,US,used,External
10,HR,India,used,External

Output files:

Accounts_India_Internal --
12,Accounts,India,free,Internal

Finance_China_Internal --
13,Finance,China,used,Internal
16,Finance,China,free,Internal

HR_India_External --
12,HR,India,free,External
10,HR,India,used,External

HR_China_Internal --
19,HR,China,used,Internal

and so on..

Please let me know how to achieve this.

As of now, I am thinking to sort the file based on these columns (2,3,5) and then run a loop on each record and start creating files. If a file does not exist, then create and add the record. Otherwise open the old file and add the record.

Is it possible to do this using shell scripting (bash)?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-12T05:18:20+00:00

Is it possible to do this using shell scripting (bash)?

If you simply want to split the files based on fields 2, 3 and 5 you can do that quickly with awk:

awk -F, '{print >> $2"_"$3"_"$5}' infile.txt

That appends each line to a file whose name is made up of fields 2, 3 and 5.

Example:

[me@home]$ awk -F, '{print >> $2"_"$3"_"$5}' infile.txt 
[me@home]$ cat Accounts_India_Internal
12,Accounts,India,free,Internal
[me@home]$ cat Finance_China_Internal
13,Finance,China,used,Internal
16,Finance,China,free,Internal

If you do want output sorted, you can first run the file through sort.

sort -k2,3 -k5,5 -t, infile.txt  | awk -F, '{print >> $2"_"$3"_"$5}'

That sorts the lines on fields 2, 3, and 5 before passing them on to the awk command.

Do note that the we’re appending to the files so if you repeat the command without deleting the output files, you’ll end up with duplicate data in the output files. To address this, as well as include your additional requirements (using first line as header for all new files) as mentioned in the chat, see this solution.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have an input file as following. I need to break them into multiple

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply