Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8662757
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 12, 20262026-06-12T16:48:34+00:00 2026-06-12T16:48:34+00:00

Consider a worker process which searches the web for particular data. Another process is

  • 0

Consider a worker process which searches the web for particular data. Another process is needed to index the results of the first process for later use. The indexing part involves writing raw data (search results) to a huge distributed HBase repository in a particular way. I can’t judge about the speed of these two processes compared to each other. We may have a situation in which one of these systems is temporarily down and needs to take the task where it is put upon its wake-up. I’m using JavaEE. Currently, this is the way I have in mind to implement this.

  1. The first process stores the result of its search in a MySQL database and sends a message containing the IDs of the new rows it has put in the table.
  2. The MOM wakes up the second process to consume the new raw data stored in the MySQL database.
  3. The second process clears up the MySQL table when it finishes indexing the data in the real database (HBase).

I need expert comments on my design to verify its appropriateness. For instance, what if the second process continuously polls the table to see if there is new records? Am I using the correct technology or it is an overkill? Should I simplify my design or am I missing something? If my solution is appropriate, is there something I should keep in mind during the implementation? Thanks in advance.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-12T16:48:35+00:00Added an answer on June 12, 2026 at 4:48 pm

    I would stick to a simpler design if possible, ditch the MySQL staging table and stick to JMS.

    So, something like this would do it:

    1. [P1] sends the search result to some JMS queue “INDEX.QUEUE”.
    2. [P2] Simply eats messages of the queue “INDEX.QUEUE” asyncrhonously and generates search indices from the search result in the message payload.

    Messaging is there to help you with these tasks, polling a database table is pretty much the same, but trickier, so why reinvent the wheel when you have a persistent and transactional MOM available that is designed for this task.

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Consider the following code: public interface A { public A another(); } public interface
Consider the following Python script, which uses SQLAlchemy and the Python multiprocessing module. This
Please consider the following code and the explanation from this Mozilla tutorial Using web
Consider the following example: <form action=process.php id=myForm> ..... ....... all my form elements </form>
Let's consider the following table models for sqlalchemy in python. class Worker(Base): id Column(Integer,
Consider this sequential procedure on a data structure containing collections (for simplicity, call them
Consider a worker loop that has something like: ... auto msg = new immutable(DataWrittenMsg)(bytesWritten);
I am interested to create virtual directories on blob storage VHD. First I consider
Consider the following code in a class called Worker FileSystemWatcher watcher = new FileSystemWatcher();
I'm stuck on a simple problem in prolog. Let's consider the program worker(bill). worker(smitt).

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.