I am trying to use zipfile module to read a file in an archive.

Question

0

Asked: June 7, 20262026-06-07T15:23:41+00:00 2026-06-07T15:23:41+00:00

I am trying to use zipfile module to read a file in an archive.

0

I am trying to use zipfile module to read a file in an archive. the uncompressed file is ~3GB and the compressed file is 200MB. I don’t want them in memory as I process the compressed file line by line. So far I have noticed a memory overuse using the following code:

import zipfile
f = open(...)
z = zipfile.ZipFile(f)
for line in zipfile.open(...).readlines()
  print line

I did it in C# using the SharpZipLib:

var fStream = File.OpenRead("...");
var unzipper = new ICSharpCode.SharpZipLib.Zip.ZipFile(fStream);
var dataStream =  unzipper.GetInputStream(0);

dataStream is uncompressed. I can’t seem to find a way to do it in Python. Help will be appreciated.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-07T15:23:43+00:00

Python file objects provide iterators, which will read line by line. file.readlines() reads them all and returns a list – which means it needs to read everything into memory. The better approach (which should always be preferred over readlines()) is to just loop over the object itself, E.g:

import zipfile
with zipfile.ZipFile(...) as z:
    with z.open(...) as f:
        for line in f:
            print line

Note my use of the with statement – file objects are context managers, and the with statement lets us easily write readable code that ensures files are closed when the block is exited (even upon exceptions). This, again, should always be used when dealing with files.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am trying to use zipfile module to read a file in an archive.

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply