Suppose I have a csv file with 400 columns. I cannot load the entire

Question

0

Asked: June 13, 20262026-06-13T21:50:50+00:00 2026-06-13T21:50:50+00:00

Suppose I have a csv file with 400 columns. I cannot load the entire

0

Suppose I have a csv file with 400 columns. I cannot load the entire file into a DataFrame (won’t fit in memory). However, I only really want 50 columns, and this will fit in memory. I don’t see any built in Pandas way to do this. What do you suggest? I’m open to using the PyTables interface, or pandas.io.sql.

The best-case scenario would be a function like: pandas.read_csv(...., columns=['name', 'age',...,'income']). I.e. we pass a list of column names (or numbers) that will be loaded.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-13T21:50:52+00:00

Editorial Team

2026-06-13T21:50:52+00:00Added an answer on June 13, 2026 at 9:50 pm

There’s no default way to do this right now. I would suggest chunking the file and iterating over it and discarding the columns you don’t want.
So something like pd.concat([x.ix[:, cols_to_keep] for x in pd.read_csv(..., chunksize=200)])

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

Suppose I have a csv file with 400 columns. I cannot load the entire

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply