Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 4587826
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 21, 20262026-05-21T21:50:09+00:00 2026-05-21T21:50:09+00:00

I am working on a project, where we are trying to introduce a searchframework.

  • 0

I am working on a project, where we are trying to introduce a searchframework. We are about to start development soon, we have only done some poc-work up till now. We are struggling with estimatesfor hardware. I am uncertain if our performance requirements can be met using a single server setup, or if we need to go for a replicated, or distrbuted solution.

Here are our main requirements

  • Search in semi-structured data
    • Documents contains 15 fields all of whom should be searchable
    • Mostly numeric id’s
    • Dates
    • Names
  • 10+ millions documents in index
  • 30-40 updates, in batches every minute
  • <100 ms response time searches with several boolean operators for 100 + queries pr minute

Questions

1) Is it feasible to get this performance on a singleserver setup?

2) If not what is an appropriate setup to meet the performance requirements.

3) We are considering several frameworks on top of Lucene, amongst them Solr and Zoie. What distributed architecture would be necessary to handle the descibed load and performance requirements.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-21T21:50:09+00:00Added an answer on May 21, 2026 at 9:50 pm

    1) Is it feasible to get this performance on a singleserver setup?

    Yes, I think so. But it’s a kind of “borderline” (I hope you know, what I mean)
    What you need is enough RAM and CPU power. Finlay it depends on the size of “big” fileds, like fulltexte or so and the size of your database.

    In comparison I use lucene with 1.2 million docs, 7 fileds, mostly short fileds (date,numbers,..) but also including one big textfield (500-5000 characters). The size of this mysql database (which is indexed by lucene) is 1-2 GB. The System runs on an small single CPU VMware Host with 4GB of RAM. The Fulltext-Search results returned in 100-400ms.
    If you don’t have big textfields, your results will return faster. (depending on the kind of search -> for example facettet search)
    For example: an facetet search on an char(255) Filed, returned in <70ms

    Probably for your configuration an non visualized Hardware with lots of memory (>32GB) and >8 cores would be useful.

    30-40 updates, in batches every minute

    does it mean 30-40 new documents per minute? that’s no problem!
    30-40 updates per minute with lots of new documents would be more challenging.
    Additional you should optimize your index periodically (for example nightly)

    3) We are considering several frameworks on top of Lucene, amongst them Solr and Zoie.

    Solr is running as an tomcat application. Here you have to define for example the RAM (look above), which is assigned to your search engine.
    There are different possibilities to split your index (for more performance or faster update), clustering is also possible.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I have been working on a project and trying to find the source of
I've got a django project working on a development server(ubuntu) that we have been
I have been trying to get xapian working django haystack for a project im
I am working on an Actionscript 2 project - trying to use the XML
I am working on rails project and I am trying to get exceptions to
The project I am working on were are trying to come up with a
While working on an existing project I suddenly got the following error when trying
I am trying to setup tracd for the project I am currently working on.
I'm trying to automate the build of the project I'm working on. My ultimate
I'm trying to learn GNUMake for a small project I'm working on. So far,

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.