Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7530265
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 30, 20262026-05-30T04:51:53+00:00 2026-05-30T04:51:53+00:00

I have a large database containing words and their inflected forms , e.g.: BASIC_FORM

  • 0

I have a large database containing words and their inflected forms, e.g.:

BASIC_FORM ##### INFLECED_FORM

talk ----- talk
talk ----- talking
talk ----- talked
talk ----- talks
paragraph ----- paragraph
paragraph ----- paragraphs
...

This database requires a lot of disk space, of course, as soon as it has 1 million entries or more.

What is the best method to “compress” that set of data, i.e. reduce the required amount of disk space while no information is lost?

My first idea was to create an extra column which holds the number of characters that can be copied from the beginning of the basic form. Then you just have to save the part of the inflected form that differs, e.g.:

BASIC_FORM ##### NUM_EQUAL ##### INFLECED_FORM

talk ----- 4 ----- 
talk ----- 4 ----- ing
talk ----- 4 ----- ed
talk ----- 4 ----- s
try ----- 3 ----- 
try ----- 2 ----- ied
paragraph ----- 9 ----- 
paragraph ----- 9 ----- s
...

This should save some amount of disk space as “NUM_EQUAL” can be saved as TINYINT in MySQL (for example) so it requires only 1 byte and in the string “INFLECTED_FORM” you usually save more than 1 character (i.e. more than 1 byte).

Do you have other suggestions to save disk space?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-30T04:51:55+00:00Added an answer on May 30, 2026 at 4:51 am

    You should normalize the model. That means, create a separate table for the basic_form. I’m not sure how much space you will save because that way because that will depend on the data (the longer the words you have and the more inflections you have, the more space you’ll save). However, let’s say you only have one word and one inflected word for each (I know that’s not the case, but let’s take it to that extreme), then having two tables would increase the storage needed.

    Now, after aplying the previous refactor (that will also save you some headaches, as normalization always do!) you can also apply YOUR system for reducing the size it takes to store the inlections too.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

We have a large database containing a lot of stored procedures. Now we are
I have data database containing some rather large strings, each of which holds a
I have a MYSQL database containing the names of a large collection of people.
I have a fairly large database containing a number of different tables representing different
I have large database table, approximately 5GB, now I wan to getCurrentSnapshot of Database
I have a large database and would like to select table names that have
I have a large database of normalized order data that is becoming very slow
I have a large database of users (~200,000) that I'm transferring from a ASP.NET
I have a large database table that I need to display on a Windows
I have a large database and am putting together a report of the data.

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.