I was thinking about compressing large blocks of text using most frequent english words,

Question

0

Asked: June 12, 20262026-06-12T20:07:30+00:00 2026-06-12T20:07:30+00:00

I was thinking about compressing large blocks of text using most frequent english words,

0

I was thinking about compressing large blocks of text using most frequent english words, but now I doubt it would be efficient, since lzw seems to be achieving just this in a better way.

Still, I can’t shake the feeling compressing character one by one is a little “brutal”, since one could just analyze the structure of sentences to better organize it into smaller chunks of data, and the structure is not exactly the same when decompressed, it could use classic compression methods.

Does “basic” NLP allows that ?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-12T20:07:31+00:00

Editorial Team

2026-06-12T20:07:31+00:00Added an answer on June 12, 2026 at 8:07 pm

NLP?

Standard compression techniques can be applied to words instead of characters. These techniques would assign probabilities to what the next word is, based on the preceding words. I have not seen this in practice though, since there are so many more words than characters, resulting in prohibitive memory usage and excessive execution time for even low-order models.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I was thinking about compressing large blocks of text using most frequent english words,

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply