I have a big array ( 1000x500000x6 ) that is stored in a pyTables

Question

0

Asked: June 4, 20262026-06-04T04:25:09+00:00 2026-06-04T04:25:09+00:00

I have a big array ( 1000x500000x6 ) that is stored in a pyTables

0

I have a big array ( 1000x500000x6 ) that is stored in a pyTables file. I am doing some calculations on it that are fairly optimized in terms of speed, but what is taking the most time is the slicing of the array.

At the beginning of the script, I need to get a subset of the rows : reduced_data = data[row_indices, :, :] and then, for this reduced dataset, I need to access:

columns one by one: reduced_data[:,clm_indice,:]
a subset of the columns: reduced_data[:,clm_indices,:]

Getting these arrays takes forever. Is there any way to speed that up ? storing the data differently for example ?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-04T04:25:10+00:00

Editorial Team

2026-06-04T04:25:10+00:00Added an answer on June 4, 2026 at 4:25 am

You can try choosing the chunkshape of your array wisely, see: http://pytables.github.com/usersguide/libref.html#tables.File.createCArray
This option controls in which order the data is physically stored in the file, so it might help to speed up access.

With some luck, for your data access pattern, something like chunkshape=(1000, 1, 6) might work.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I have a big array ( 1000x500000x6 ) that is stored in a pyTables

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply