Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8952257
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 15, 20262026-06-15T13:50:12+00:00 2026-06-15T13:50:12+00:00

From my understanding rows are inserted into HBase tables and are getting stored as

  • 0

From my understanding rows are inserted into HBase tables and are getting stored as regions in different region server. So, the region server stores the data

Similarly in terms of Hadoop, data is stored in the data nodes present in the hadoop cluster.

Lets say that i have HBase 0.90.6 configured on top of Hadoop 1.1.1 as follows

2 nodes – master and slave

  1. Master node acts as,
    • Hadoop – Namenode, Secondary Namenode, job tracker, data node, task tracker
    • HBase – Master, RegionServer and zookeeper.
  2. Slave node acts as,
    • Hadoop datanode and task tracker
    • HBase region server

Based on my statement if table data is stored in the region servers; then what is the role of the data nodes and region servers?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-15T13:50:13+00:00Added an answer on June 15, 2026 at 1:50 pm

    Data nodes store data. Region server(s) essentially buffer I/O operations; data is permanently stored on HDFS (that is, data nodes). I do not think that putting region server on your ‘master’ node is a good idea.

    Here is a simplified picture of how regions are managed:

    You have a cluster running HDFS (NameNode + DataNodes) with replication factor of 3 (each HDFS block is copied into 3 different DataNodes).

    You run RegionServers on the same servers as DataNodes. When write request comes to RegionServer it first writes changes into memory and commit log; then at some point it decides that it is time to write changes to permanent storage on HDFS. Here is were data locality comes into play: since you run RegionServer and DataNode on the same server, first HDFS block replica of the file will be written to the same server. Two other replicas will be written to, well, other DataNodes. As a result RegionServer serving the region will almost always have access to local copy of data.

    What if RegionServer crashes or RegionMaster decided to reassign region to another RegionServer (to keep cluster balanced)? New RegionServer will be forced to perform remote read first, but as soon as compaction is performed (merging of change log into the data) – new file will be written to HDFS by the new RegionServer, and local copy will be created on the RegionServer (again, because DataNode and RegionServer runs on the same server).

    Note: in case of RegionServer crash, regions previously assigned to it will be reassigned to multiple RegionServers.

    Good reads:

    • Tom White, “Hadoop, The Definitive Guide” has good explanation of HDFS architecture. Unfortunately I did not read original Google GFS paper, so I cannot tell if it is easy to follow.

    • Google BigTable article. HBase is implementation of Google BigTable, and I found that architecture description in this article is the easiest to follow.

    Here is nomenclature differences between Google Bigtable and HBase implementation (from Lars George, “HBase, The Definitive Guide”):

    • HBase – Bigtable
    • Region – Tablet
    • RegionServer – Tablet server
    • Flush – Minor compaction
    • Minor compaction – Merging compaction
    • Major compaction – Major compaction
    • Write ahead log – Commit log
    • HDFS – GFS
    • Hadoop MapReduce – MapReduce
    • MemStore – memtable
    • HFile – SSTable
    • Zookeeper – Chubby
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

From what little understanding of Cassandra I have, it seems that data locality is
My understanding is that OneToOneField is used for just 1 row of data from
I have an SSIS package set up to export data from a SQL Server
I need some help in understanding my error. I want to read data from
From my understanding of the manual for DECIMAL in the mysql docs, it states
From my understanding, the soft keyboard is actually a dialog window that underlies all
From my understanding, each of these methods: get() and put() are atomic. But, when
From my understanding of the docs this general approach should work: begin try1 rescue
From my understanding by reading several articles I assumed Process Address Space(PAS) and Virtual
MariaDB 5.3 introduced dynamic columns. From my understanding the next version of mysql should

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.