If you have the data in a datset...you can do…

Question

0

Asked: May 11, 20262026-05-11T20:06:46+00:00 2026-05-11T20:06:46+00:00

I am trying to calculate an initial buffer size to use when decompressing data

0

I am trying to calculate an initial buffer size to use when decompressing data of an unknown size. I have a bunch of data points from existing compression streams but don’t know the best way to analyze them.

Data points are the compressed size and the ratio to uncompressed size.
For example:
100425 (compressed size) x 1.3413 (compression ratio) = 134,700 (uncompressed size)

The compressed data stream doesn’t store the uncompressed size so the decompressor has to alloc an initial buffer size and realloc if it overflows. I’ll looking for the “best” initial size to alloc the buffer given the compressed size. I have over 293,000 data points.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-11T20:06:46+00:00

Given that you have a lot of data points of how your compression works, I’d recommend analyzing your compression data, to get a mean compression standard and a standard deviation. Then, I’d recommend setting your buffer size initially to your original size * your compression size at 2 standard deviations above the mean; this will mean that your buffer is the right size for 93% of your cases. If you want your buffer to not need reallocation for more cases, increase the number of standard deviations above the mean that you’re allocating for.

How to approach applying for a job at a company ...

How to handle personal stress caused by utterly incompetent and ...

What is a programmer’s life like?

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions