Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8385801
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 9, 20262026-06-09T17:45:40+00:00 2026-06-09T17:45:40+00:00

I am currently researching what database to use for a project I am working

  • 0

I am currently researching what database to use for a project I am working on. Hopefully you guys can give me some hints.

The project is an automated web crawler that checks websites as per a user’s request, scrapes data under certain circumstances, and creates log files of what was done.

Requirements:

  • Only few tables with few columns; predefining columns is no problem
  • No overly complex associations between models
  • Huge amount of date & time based queries
  • Due to logging, database will grow rapidly and use up a lot of space
  • Should be able to scale over multiple servers
  • Fields contain mostly ids (int), strings (around 200-500 characters max), and unix timestamps
  • Two different types of servers will simultaneously read/write data directly to/from it:
    • One(/later more) rails app that takes user input and displays results upon request
    • One(/later more) Node.js server that functions as the executing crawler/scraper. It will have enough load to run continuously and make dozens of database queries every second.

I assume it will neither be a graph database (no complex associations), nor a memory based key/value store (too much data to hold in cached). I’m still on the fence for every other type of database I could find, each seems to have it’s merits.

So, any advice from the pros how I should decide?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-09T17:45:42+00:00Added an answer on June 9, 2026 at 5:45 pm

    I would agree with Vladimir that you would want to consider a document-based database for this scenario. I am most familiar with MongoDB. My reasons for using it here are as follows:

    1. Your ‘schema requirements’ of “only a few tables with few columns” fits well with the NoSQL nature of MongoDB.
    2. Same as above for “no overly complex associations between nodes” — you will want to decide whether you’d prefer nested documents or using dbref (I prefer the former)
    3. Huge amount of time-based data (and other scaling requirements) – MongoDB scales well via sharding or partitioning
    4. Read/write access – this is why I am recommending MongoDB over something like Hadoop. The interactive query requirement is best met by something other than a Hadoop-style store, as this type of storage is designed for batch (rather than interactive query) requirements.
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm developing a Spring web app and am hoping someone can give me some
I'm currently working on some evaluation work for a project that I'm planning. I
I'm currently researching a project for the place that I work in. We are
I am currently researching GWT as a means to develop web apps for a
After researching all day and night, I have something that is currently working. However,
Im currently developing a java web application, and im researching how i should combine
I'm currently researching cross-domain SSO implementations, and I may not be able to use
In my iPhone/iPad project, I'm showing some data from SQLite database. This database is
I'm researching a security model for our web portal, we have decided to use
I'm currently researching the best way to approach building a JavaScript widget someone can

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.