I imagine this is an easy one for a decent Python dev – Im

Question

0

Asked: June 10, 20262026-06-10T10:33:02+00:00 2026-06-10T10:33:02+00:00

I imagine this is an easy one for a decent Python dev – Im

0

I imagine this is an easy one for a decent Python dev – Im still learning! Given a csv with duplicate emails I would like to iterate and write out the count of duplicate emails eg:

infile.csv

COLUMN 0
some@email.com
some@email.com
another@address.com
example@email.com

outfile.csv

COLUMN 0                 COLUMN 1
some@email.com           2
another@address.com      1
example@email.com        1

So far I can remove duplicates with

import csv

f = csv.reader(open('infile.csv','rb'))
writer = csv.writer(open('outfile.csv','wb'))
emails = set()


for row in f:
    if row[0] not in emails:
        writer.writerow(row)
        emails.add( row[0] )

but I am having trouble writing the count to a new column.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-10T10:33:03+00:00

Editorial Team

2026-06-10T10:33:03+00:00Added an answer on June 10, 2026 at 10:33 am

Using defaultdict which is in Python2.6

from collections import defaultdict

# count all the emails before we write anything out
emails = defaultdict(int)
for row in f:
    emails[row[0]] += 1

# now write the file
for row in email.items():
    writer.writerow(row)

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I imagine this is an easy one for a decent Python dev – Im

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply