I wrote a code to perform some simple csv formatting but I know it’s

Question

0

Editorial Team

Asked: June 3, 20262026-06-03T18:36:03+00:00 2026-06-03T18:36:03+00:00

I wrote a code to perform some simple csv formatting but I know it’s

0

I wrote a code to perform some simple csv formatting but I know it’s not as good as it could be.

Here’s the input

1,a
1,b
1,c
2,d
2,e
3,a
3,d
3,e
3,f

Here’s the output I want

['1','a','b','c']
['2','d','e']
['3','a','d','e','f']

This is the code I wrote

import csv
input = csv.reader(open('book1.csv'))
output = open('output.csv', 'w')
job=[0,0]
for row in input:
    if row[0] == job[1]:
        job.append(row[1])
    else:
        print(job)
        #output.write(",".join(job))
        job[1] = row[0]
        job = [job[0], job[1]]
        job.append(row[1])

This is the output

[0,0]
[0, '1', 'a', 'b', 'c']
[0, '2', 'd', 'e']

The questions I have are as follows

How can I finish the else statement for the line? Also how can I get away with adding 0 as the zeroth element in the set. I also would like the code to output the last “job” set. Lastly does anyone have any suggestions for improving this code?

I ask because I would like to get much better at writing code, instead of just hacking it together. Any responses would be greatly appreciated!
Thanks in advance

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-03T18:36:09+00:00

What you’re trying to do is group the second column by the first column. Python has a tool for that, itertools.groupby:

groups = itertools.groupby(input, key=operator.itemgetter(0))

is an iterator yielding (key, group) tuples, where the key is the first item in the rows, and each groupis an iterator of lines in the group.

operator.itemgetter does the same thing as the [] syntax — gets the item specified. operator.itemgetter(0) is the same as:

def itemgetter_0(seq_or_mapping):
    return seq_or_mapping[0]

To extract the values and create lists, you can:

output = [[key] + map(operator.itemgetter(1), group) for key, group in groups]

which starts each list with the key and then extracts the second item from each line and adds them to the list.

For your example input, the output will be:

[['1', 'a', 'b', 'c'], ['2', 'd', 'e'], ['3', 'a', 'd', 'e', 'f']]

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I wrote a code to perform some simple csv formatting but I know it’s

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply