Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6346253
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 24, 20262026-05-24T20:58:58+00:00 2026-05-24T20:58:58+00:00

I am working on a feature and could use opinions on which database I

  • 0

I am working on a feature and could use opinions on which database I should use to solve this problem.

We have a Rails application using MySQL. We have no issues with MySQL and it runs great. But for a new feature, we are deciding whether to stay MySQL or not. To simplify the problem, let’s assume there is a User and Message model. A user can create messages. The message is delivered to other users based on their association with the poster.

Obviously there is an association based on friendship but there are many many more associations based on the user’s profile. I plan to store some metadata about the poster along with the message. This way I don’t have to pull the metadata each time when I query the messages.

Therefore, a message might look like this:

{
  id: 1,
  message: "Hi",
  created_at: 1234567890,
  metadata: {
    user_id: 555,
    category_1: null,
    category_2: null,
    category_3: null,
    ...
  }
}

When I query the messages, I need to be able to query based on zero or more metadata attributes. This call needs to be fast and occurs very often.

Due to the number of metadata attributes and the fact any number can be included in a query, creating SQL indexes here doesn’t seem like a good idea.

Personally, I have experience with MySQL and MongoDB. I’ve started research on Cassandra, HBase, Riak and CouchDB. I could use some help from people who might have done the research as to which database is the right one for my task.

And yes, the messages table can easily grow into millions or rows.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-24T20:58:58+00:00Added an answer on May 24, 2026 at 8:58 pm

    This is a very open ended question, so all we can do is give advice based on experience. The first thing to consider is if it’s a good idea to decide on using something you haven’t used before, instead of using MySQL, which you are familiar with. It’s boring not to use shiny new things when you have the opportunity, but believe me that it’s terrible when you’ve painted yourself in a corner because you though that the new toy would do everything it said on the box. Nothing ever works the way it says in the blog posts.

    I mostly have experience with MongoDB. It’s a terrible choice unless you want to spend a lot of time trying different things and realizing they don’t work. Once you scale up a bit you basically can’t use things like secondary indexes, updates, and other things that make Mongo an otherwise awesomely nice tool (most of this has to do with its global write lock and the database format on disk, it basically sucks at concurrency and fragments really easily if you remove data).

    I don’t agree that HBase is out of the question, it doesn’t have secondary indexes, but you can’t use those anyway once you get above a certain traffic load. The same goes for Cassandra (which is easier to deploy and work with than HBase). Basically you will have to implement your own indexing which ever solution you choose.

    What you should consider is things like if you need consistency over availability, or vice versa (e.g. how bad is it if a message is lost or delayed vs. how bad is it if a user can’t post or read a message), or if you will do updates to your data (e.g. data in Riak is an opaque blob, to change it you need to read it and write it back, in Cassandra, HBase and MongoDB you can add and remove properties without first reading the object). Ease of use is also an important factor, and Mongo is certainly easy to use from the programmer’s perspective, and HBase is horrible, but just spend some time making your own library that encapsulates the nasty stuff, it will be worth it.

    Finally, don’t listen to me, try them out and see how they perform and how it feels. Make sure you try to load it as hard as you can, and make sure you test everything you will do. I’ve made the mistake of not testing what happens when you remove lots of data in MongoDB, and have paid for that dearly.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm working on a site, which needs to have a feature of users capturing
I've been working on this problem for 7 hours now, and I still have
In Git I can do this: 1. Start working on new feature: $ git
I'm working on a JavaScript image resizing feature which rely on the IE only
I am working on wireframe for new feature in our application. One of the
I'm working on a search feature for my application, I want to search all
If I am working in multiple branches on a single feature, I use git
I'm working on what should (hopefully) be a small feature for a .NET software
Django newbie at work and I could use some pointers. I'm using django-profile and
I am working on an application and one feature that would make it really

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.