Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 7811843
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 2, 20262026-06-02T04:18:51+00:00 2026-06-02T04:18:51+00:00

We are looking to build an internal real-time analytics system using MongoDB or HBase

  • 0

We are looking to build an internal real-time analytics system using MongoDB or HBase (other options possible, but these are the two we are looking at right now.) We picked these two because:

  • We will be recording a huge number of events, and want to be able to
    horizontally and vertically scale.
  • We will get “long tail” requests – for example, “give me the metrics for this item three years ago” – and hence need to do real-time queries over a large range, whether time or other values.

We have tried to use MySQL for the analytics tables, but the tables get large really fast and we don’t want our analytics tables to be a “dead weight” on the rest of our MySQL system. (I am open to advice on using MySQL or other solutions for analytics tables if anyone has insight.)

So, the question is: we will often have to do real-time lookups and JOINs across both our analytics data store and MongoDB/Hbase. For instance, we will be storing the “id” of a table in analytics, but for any request, we will have to “join” it against, say, the “title” and “content” fields of a “content” table in MySQL.

Is there any way to do JOINs across MongoDB/HBase and MySQL, or is this something we would have to custom-build into our application? And if we build it in the application, how would we deal with custom JOINs on large data sets across MySQL and MongoDB/HBase?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-02T04:18:52+00:00Added an answer on June 2, 2026 at 4:18 am

    Is there any way to do JOINs across MongoDB/HBase and MySQL…

    MongoDB has absolutely zero support for JOINs. You would basically have to build your own join system if you use MongoDB

    HBase is build over top of Hadoop and, in theory, some JOIN-like syntax is possible.

    However, joining between MySQL and HBase is almost certainly custom code. You may want to consider copying the MySQL data into HBase.

    … need to do real-time queries over a large range… “give me the metrics for this item three years ago”

    Recognize that this is going to be a tremendous challenge and that you are unlikely to find anything “out of the box” that does this for you.

    You will need to be very judicious about the term “real-time”. It sounds like you want “relatively responsive queries” rather than “real-time queries”. You will have to scope “relatively responsive” as this will massively affect both the cost and the products you use.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm looking to build a program that works within soft real-time schedules; to do
We looking to build an application to maintain meeting minutes and similar stuff using
I'm looking to build a Rails app to do internal reporting (make charts or
I'm trying to build a Quicksilver style search system for the internal web app
I am looking in a way to build an internal search engine that gonna
I'm trying to build a streaming solution for an internal app, but am drawing
Im looking to build a thread manager for an application. I have already started
I'm looking to build an reusable control or custom helper for my MVC project.
I'm looking to build a VM into a game and was wondering if anyone
I'm looking to build a query that will use the non-clustered indexing plan on

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.