Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 8674093
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: June 12, 20262026-06-12T19:37:14+00:00 2026-06-12T19:37:14+00:00

We have a 6 node cassandra cluster with a very large number of reads

  • 0

We have a 6 node cassandra cluster with a very large number of reads per second and very few writes. The whole application comprise of:

  • web app server that uses one cassandra node
  • 5 x web service machines each using its own cassandra node (pycassa’s Pool server_list is always one node)

Web app talking to cassandra is doing read and write operations (but very very few, only when somebody actually uses the app UI, which does not happen often). Web service however is very heavily loaded with traffic from 3rd party service. Load balancer directs the traffic to all 5 servers and each servers bombards its own cassandra node (which is physically in yet another server) with lots of get() and multiget() requests. Once in a while a set() is used but that’s like once per 10 thousand reads or something.

Having this kind of usage we decided to use replication factor of 6. If each cassandra has 100% of data, reads should be faster and load should be balanced more evenly. We’ve updated keyspace strategy_options and run nodetool repair on each node to transfer the data. It went OK.

Now the very strange thing: all six cassandra nodes are at a very high CPU usage. It is understood in case of the five nodes that are used by the web service, but we can’t explain why the webapp cassandra node is also consuming that much CPU, as if it was performing a lot of reads. It’s as if the replication didn’t work at all – looks like each cassandra node talks to all other nodes whenever a get() happens and the whole ring is extremely stressed.

I made yet another experiment to prove this, I took down one of the web servers and I was looking at the corresponding cassandra node. After the server went down, I expected the CPU usage on this cassandra node to be near zero, because no other machine points to it. But it wasn’t zero, it dropped slightly but still was at a very high level (60% CPU usage).

We’re using pycassa and we did not manipulate the consistency level so it’s at default ConsistencyLevel.ONE

I hope you see what I mean… If replication factor equals number of nodes in the ring, and read consistency level is default (ONE), then each node should be kind of independent in terms of reads: if no-one is doing any reads from given node, the CPU usage on this node should be minimal, correct? However even if we disconnect the only client that sees the node we still observe high CPU usage as if someone still kept reading from it. Where is this load coming from, how is it possible to investigate what’s going on?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-06-12T19:37:16+00:00Added an answer on June 12, 2026 at 7:37 pm

    Correct me if I’m wrong but I guess that the load you’re seeing on the nodes in the cluster is the read repairs which happens in the background. When you’re reading ConsistencyLevel.ONE on a node in the cluster the data will be returned immediately and the read will trigger a read repair in the background which will send a digest query to all other replicas for the requested data to assure consistency.
    Since replication factor is 6 (all data on all nodes), for every read, read repair requests will be sent to all 6 nodes.

    Cassandra read repair

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm running a 4-node Cassandra cluster. Some of our nodes have some very large
I have a 3 node Cassandra cluster with replication factor of 2. Because one
I have a 2 node cassandra cluster with a replication factor of 2 and
I have a Cassandra node which is joining an existing cluster. It begins to
I have a 4 node brisk cluster with 2 Cassandra nodes in Cassandra DC
I have a 3 node Brisk cluster (Briskv1.0_beta2). Cassandra is working fine (all three
I have a node.js (v0.6.12) application that starts by evaluating a Javascript file, startup.js.
I have a node application that is not a web application - it completes
Currently i have a node.js and socket.io application in development on my local machine
One of my cassandra cluster node shows following result when I do 'top' PID

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.