Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 63445
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 10, 20262026-05-10T18:38:01+00:00 2026-05-10T18:38:01+00:00

I have a MySQL database table with a couple thousand rows. The table is

  • 0

I have a MySQL database table with a couple thousand rows. The table is setup like so:

id | text

The id column is an auto-incrementing integer, and the text column is a 200-character varchar.

Say I have the following rows:

3 | I think I'll have duck tonight

4 | Maybe the chicken will be alright

5 | I have a pet duck now, awesome!

6 | I love duck

Then the list I’m wanting to generate might be something like:

  • 3 occurrences of ‘duck’
  • 3 occurrences of ‘I’
  • 2 occurrences of ‘have’
  • 1 occurrences of ‘chicken’
  • .etc .etc

Plus, I’ll probably want to maintain a list of substrings to ignore from the list, like ‘I’, ‘will’ and ‘have. It’s important to note that I do not know what people will post.

I do not have a list of words that I want to monitor, I just want to find the most common substrings. I’ll then filter out any erroneous substrings that are not interesting from the list manually by editing the query.

Can anyone suggest the best way to do this? Thanks everyone!

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. 2026-05-10T18:38:02+00:00Added an answer on May 10, 2026 at 6:38 pm

    MySQL already does this for you.

    First make sure your table is a MyISAM table

    Define a FULLTEXT index on your column

    On a shell command line navigate to the folder where your MySQL data is stored, then type:

    myisam_ftdump -c yourtablename 1 >wordfreq.dump 

    You can then process wordfreq.dump to eliminate the unwanted column and sort by frequency decending.

    You could do all the above with a single command line and some sed/awk wizardry no doubt. And you could incorporate it into your program without needing a dump file.

    More info on myisam_ftdump here: http://dev.mysql.com/doc/refman/5.0/en/myisam-ftdump.html

    Oh… one more thing, the stopwords for MySQL are precompiled into the engine. And words with 3 or less characters are not indexed. The full list is here:

    http://dev.mysql.com/doc/refman/5.0/en/fulltext-stopwords.html

    If this list isn’t adequate for your needs, or you need words with less than 3 characters to count, the only way is to recompile MySQL with different rules for FULLTEXT. I don’t recommend that!

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Ask A Question

Stats

  • Questions 118k
  • Answers 118k
  • Best Answers 0
  • User 1
  • Popular
  • Answers
  • Editorial Team

    How to approach applying for a job at a company ...

    • 7 Answers
  • Editorial Team

    How to handle personal stress caused by utterly incompetent and ...

    • 5 Answers
  • Editorial Team

    What is a programmer’s life like?

    • 5 Answers
  • Editorial Team
    Editorial Team added an answer ttyname(0) will return the filename of the current terminal associated… May 11, 2026 at 11:34 pm
  • Editorial Team
    Editorial Team added an answer I just reread that, and Jeff could mean anything at… May 11, 2026 at 11:34 pm
  • Editorial Team
    Editorial Team added an answer A quick test shows that this: <img class="test" src="testimage.jpg" />… May 11, 2026 at 11:34 pm

Related Questions

I'm creating a user-based website. For each user, I'll need a few MySQL tables
I have a database with a table, storing changes in account-balance across a couple
I am a fairly new MySQL developer and am starting on a project that
I'm in need of help - I've got two mysql databases on different servers

Trending Tags

analytics british company computer developers django employee employer english facebook french google interview javascript language life php programmer programs salary

Top Members

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.