I know sed or awk can tackle this kind of problem more elegantly perhaps.

Question

0

Asked: May 19, 20262026-05-19T09:19:48+00:00 2026-05-19T09:19:48+00:00

I know sed or awk can tackle this kind of problem more elegantly perhaps.

0

I know sed or awk can tackle this kind of problem more elegantly perhaps. But I went the python way, so the problem is that I would like to renumber the first column of my data file from 1 to #of lines in the file. Is that a good idea to read the file by readlines? For small files perhaps, but large files not I suppose. So here is what I came up as a first attempt, any comments are appreciated.

#!/usr/bin/env python

import sys

try:
    infilename = sys.argv[1]; outfilename = sys.argv[2];
except:
    print "Usage is <script> inFile outFile"

ifile = open(infilename,'r')
ofile = open(outfilename, 'w')

lines = ifile.readlines();

i=1
for line in lines: 
    list = line.split();
    list[0] = i
    i += 1 
    for val in list:
        ofile.write("%d " % int(val))
    ofile.write('\n')
    del list

ifile.close()
ofile.close()

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-19T09:19:49+00:00

You can iterate over the file to keep only the current line in memory:

#!/usr/bin/env python
import sys

try:
    # dont use ; !
    infilename = sys.argv[1]
    outfilename = sys.argv[2]
except:
    print "Usage is <script> inFile outFile"


# you could use `with` here if you have a Python 2.7
ifile = open(infilename,'r')
ofile = open(outfilename, 'w')

# no need to count yourself, enumerate does that
# plus when you iterate over a file you get lines too
for i, line in enumerate(ifile, start=1):
    # dont shadow builtins like `list`
    parts = line.split()
    parts[0] = i
    # join is the inverse function to split
    new_line = ' '.join("%d" % int(val) for val in parts)
    ofile.write(new_line + '\n')

ifile.close()
ofile.close()

@Umut Tabak: ("%d" % int(val) for val in parts) is a generator expression, they are kind of like lazy lists. It gives the same items as the list comprehension ["%d" % int(val) for val in parts] but without actually creating the list.

Btw, the for block can be written even shorter, but it’s slightly different because it doesn’t enforce that all lines are ints anymore:

for i, line in enumerate(ifile, start=1):
    parts = line.split()
    parts[0] = "%d" % i
    new_line = ' '.join(parts)
    ofile.write(new_line + '\n')

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I know sed or awk can tackle this kind of problem more elegantly perhaps.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply