I am working with information from big models, which means I have a lot of big ascii files with two float columns (lets say X and Y). However, whenever I have to read these files it takes a long time, so I thought maybe converthing them to binary files will make the reading process much faster.
I converted my asciifiles into binary files using the uu.encode(ascii_file,binary_file) command, and it worked quite well (Actually, tested the decode part and I recovered the same files).
My question is: is there anyway to read the binary files directly into python and get the data into two variables (x and y)?
Thanks!
You didn’t specify how your float columns are represented in Python. The
cPicklemodule is a fast general solution, with the drawback that it creates files readable only from Python, and that it should never be allowed to read untrusted data (received from the network). It is likely to just work with all regular datatypes, including numpy arrays.If you can use numpy and store your data in numpy arrays, look into
numpy.saveandnumpy.savetxtand the corresponding loading functions, which should offer performance superior to manually extracting the data.array.arrayalso has methods for writing array data to file, with the drawback that the array data is written in the native format and cannot be read from a different architecture.