Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 4587826
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 21, 20262026-05-21T21:50:09+00:00 2026-05-21T21:50:09+00:00

I am working on a project, where we are trying to introduce a searchframework.

  • 0

I am working on a project, where we are trying to introduce a searchframework. We are about to start development soon, we have only done some poc-work up till now. We are struggling with estimatesfor hardware. I am uncertain if our performance requirements can be met using a single server setup, or if we need to go for a replicated, or distrbuted solution.

Here are our main requirements

  • Search in semi-structured data
    • Documents contains 15 fields all of whom should be searchable
    • Mostly numeric id’s
    • Dates
    • Names
  • 10+ millions documents in index
  • 30-40 updates, in batches every minute
  • <100 ms response time searches with several boolean operators for 100 + queries pr minute

Questions

1) Is it feasible to get this performance on a singleserver setup?

2) If not what is an appropriate setup to meet the performance requirements.

3) We are considering several frameworks on top of Lucene, amongst them Solr and Zoie. What distributed architecture would be necessary to handle the descibed load and performance requirements.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-21T21:50:09+00:00Added an answer on May 21, 2026 at 9:50 pm

    1) Is it feasible to get this performance on a singleserver setup?

    Yes, I think so. But it’s a kind of “borderline” (I hope you know, what I mean)
    What you need is enough RAM and CPU power. Finlay it depends on the size of “big” fileds, like fulltexte or so and the size of your database.

    In comparison I use lucene with 1.2 million docs, 7 fileds, mostly short fileds (date,numbers,..) but also including one big textfield (500-5000 characters). The size of this mysql database (which is indexed by lucene) is 1-2 GB. The System runs on an small single CPU VMware Host with 4GB of RAM. The Fulltext-Search results returned in 100-400ms.
    If you don’t have big textfields, your results will return faster. (depending on the kind of search -> for example facettet search)
    For example: an facetet search on an char(255) Filed, returned in <70ms

    Probably for your configuration an non visualized Hardware with lots of memory (>32GB) and >8 cores would be useful.

    30-40 updates, in batches every minute

    does it mean 30-40 new documents per minute? that’s no problem!
    30-40 updates per minute with lots of new documents would be more challenging.
    Additional you should optimize your index periodically (for example nightly)

    3) We are considering several frameworks on top of Lucene, amongst them Solr and Zoie.

    Solr is running as an tomcat application. Here you have to define for example the RAM (look above), which is assigned to your search engine.
    There are different possibilities to split your index (for more performance or faster update), clustering is also possible.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm working on a project and trying to render some images sitting under the
I have a working project that Im amending, it crashes after trying to use
I have a project working fine under MSVS 2010 SP1. I'm trying to convert
I have spent hours trying to get my project working and I just can't
I'm working on a project that's trying to implement some editing features using a
I've been trying to get an existing project working on local copy but have
I have been working on a project and trying to understand how these components
Have been trying to get integration testing working with my seam project and the
I have been working on a project and trying to find the source of
I've got a django project working on a development server(ubuntu) that we have been

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.