It’s taking me up to an hour to read a 1-gigabyte NetworkX graph data

Question

0

Asked: May 14, 20262026-05-14T15:34:05+00:00 2026-05-14T15:34:05+00:00

It’s taking me up to an hour to read a 1-gigabyte NetworkX graph data

0

It’s taking me up to an hour to read a 1-gigabyte NetworkX graph data structure using cPickle (its 1-GB when stored on disk as a binary pickle file).

Note that the file quickly loads into memory. In other words, if I run:

import cPickle as pickle

f = open("bigNetworkXGraph.pickle","rb")
binary_data = f.read() # This part doesn't take long
graph = pickle.loads(binary_data) # This takes ages

How can I speed this last operation up?

Note that I have tried pickling the data both in using both binary protocols (1 and 2), and it doesn’t seem to make much difference which protocol I use. Also note that although I am using the “loads” (meaning “load string”) function above, it is loading binary data, not ascii-data.

I have 128gb of RAM on the system I’m using, so I’m hoping that somebody will tell me how to increase some read buffer buried in the pickle implementation.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-14T15:34:06+00:00

Editorial Team

2026-05-14T15:34:06+00:00Added an answer on May 14, 2026 at 3:34 pm

You’re probably bound by Python object creation/allocation overhead, not the unpickling itself.
If so, there is little you can do to speed this up, except not creating all the objects. Do you need the entire structure at once? If not, you could use lazy population of the data structure (for example: represent parts of the structure by pickled strings, then unpickle them only when they are accessed).

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

It’s taking me up to an hour to read a 1-gigabyte NetworkX graph data

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply