Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 3751642
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 19, 20262026-05-19T09:06:13+00:00 2026-05-19T09:06:13+00:00

I have 160 bits of random data. Just for fun, I want to generate

  • 0

I have 160 bits of random data.

Just for fun, I want to generate pseudo-English phrase to “store” this information in. I want to be able to recover this information from the phrase.

Note: This is not a security question, I don’t care if someone else will be able to recover the information or even detect that it is there or not.

Criteria for better phrases, from most important to the least:

  • Short
  • Unique
  • Natural-looking

The current approach, suggested here:

Take three lists of 1024 nouns, verbs and adjectives each (picking most popular ones). Generate a phrase by the following pattern, reading 20 bits for each word:

Noun verb adjective verb,
Noun verb adjective verb,
Noun verb adjective verb,
Noun verb adjective verb.

Now, this seems to be a good approach, but the phrase is a bit too long and a bit too dull.

I have found a corpus of words here (Part of Speech Database).

After some ad-hoc filtering, I calculated that this corpus contains, approximately

  • 50690 usable adjectives
  • 123585 nouns
  • 15301 verbs
  • 13010 adverbs (not included in pattern, but mentioned in answers)

This allows me to use up to

  • 16 bits per adjective (actually 16.9, but I can’t figure how to use fractional bits)
  • 15 bits per noun
  • 13 bits per verb
  • 13 bits per adverb

For noun-verb-adjective-verb pattern this gives 57 bits per “sentence” in phrase. This means that, if I’ll use all words I can get from this corpus, I can generate three sentences instead of four (160 / 57 ≈ 2.8).

Noun verb adjective verb,
Noun verb adjective verb,
Noun verb adjective verb.

Still a bit too long and dull.

Any hints how can I improve it?

What I see that I can try:

  • Try to compress my data somehow before encoding. But since the data is completely random, only some phrases would be shorter (and, I guess, not by much).

  • Improve phrase pattern, so it would look better.

  • Use several patterns, using the first word in phrase to somehow indicate for future decoding which pattern was used. (For example, use the last letter or even the length of the word.) Pick pattern according to the first bytes of the data.

…I’m not that good with English to come up with better phrase patterns. Any suggestions?

  • Use more linguistics in the pattern. Different tenses etc.

…I guess, I would need much better word corpus than I have now for that. Any hints where can I get a suitable one?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-19T09:06:14+00:00Added an answer on May 19, 2026 at 9:06 am

    I would consider adding adverbs to your list. Here is a pattern I came up with:

    <Adverb>, the
        <adverb> <adjective>, <adverb> <adjective> <noun> and the
        <adverb> <adjective>, <adverb> <adjective> <noun>
    <verb> <adverb> over the <adverb> <adjective> <noun>.
    

    This can encode 181 bits of data. I derived this figure using lists I made a while back from WordNet data (probably a bit off because I included compound words):

    • 12650 usable nouns (13.6 bits/noun, rounded down)
    • 5247 usable adjectives (12.3 bits/adjective)
    • 5009 usable verbs (12.2 bits/verb)
    • 1512 usable adverbs (10.5 bits/adverb)

    Example sentence: “Soaking, the habitually goofy, socially speculative swatch and the fearlessly cataclysmic, somewhere reciprocal macrocosm foreclose angelically over the unavoidably intermittent comforter.”

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I need a different way to have global access to 160*160 bits of data,
I have this data set: x <- c(0, 40, 80, 120, 160, 200) y
So I have this string: (a url) example.html/#playYouTubeVideo=id[lBs8jPDPveg]&width[160]&height[90] I want to match id, width
I have the following xml document: <?xml version=1.0 encoding=UTF-8?> <root> <data> <child1>&#160;Well, some spaces
i have this table **ID CityCode Price** 1 ABB 100 2 CAB 160 3
I have a block of memory allocated 20bytes(160-bits) with memset value of 1. Each
I have this source XML: <?xml version=1.0?> <root> <item1> <name>test</name> <price>160</price> <stock>4</stock> <country>Belgium</country> </item1>
I have an ASP.NET application that relies on the Random class to generate a
I have around 1500 bytes of data that I want to construct a checksum
I have this request INSERT INTO SERVICEPAYANT_CLIENT (RE_ID, TYPE_DONNEES) SELECT CLIENT_ID, 160 AS TYPE_DONNEES

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.