My problem is that I have a 100,000+ different elements and as I understand

Question

0

Asked: May 23, 20262026-05-23T01:36:28+00:00 2026-05-23T01:36:28+00:00

My problem is that I have a 100,000+ different elements and as I understand

0

My problem is that I have a 100,000+ different elements and as I understand it Huffman works by assigning the most common element a 0 code, and the next 10, the next 110, 1110, 11110 and so on. My question is, if the code for the nth element is n-bits long then surely once I have passed the 32nd term it is more space efficient to just sent 32-bit data types as they are, such as ints for example? Have I missed something in the methodology?

Many thanks for any help you can offer. My current implementation works by doing

code = (code << 1) + 2;

to generate each new code (which seems to be correct!), but the only way I could encode over 100,000 elements would be to have an int[] in a makeshift new data type, where to access the value we would read from the int array as one continuous long symbol… that’s not as space efficient as just transporting a 32-bit int? Or is it more a case of Huffmans use being with its prefix codes, and being able to determine each unique value in a continuous bit stream unambiguously?

Thanks

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-23T01:36:29+00:00

Editorial Team

2026-05-23T01:36:29+00:00Added an answer on May 23, 2026 at 1:36 am

Your understanding is a bit off – take a look at http://en.wikipedia.org/wiki/Huffman_coding. And you have to pack the encoded bits into machine words in order to get compression – Huffman encoded data can best be thought of as a bit-stream.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

My problem is that I have a 100,000+ different elements and as I understand

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply