Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6607393
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 25, 20262026-05-25T19:29:55+00:00 2026-05-25T19:29:55+00:00

Is there anyone who got a chance to work on both? I need to

  • 0

Is there anyone who got a chance to work on both? I need to set up a framework to move data around. Basically, we have clickstream data coming in as text files. This data needs to be moved around form the app-servers to HDFS, and then to S3 after archival.

I need help in choosing between Flume and Scribe. Which one is better in terms of manageability, setting up and which is easier to customize?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-25T19:29:55+00:00Added an answer on May 25, 2026 at 7:29 pm

    View the answer posted here

    I’ll quote the answer:

    1. Flume allows you to configure your Flume installation from a
      central point, without having to ssh into every machine, update a
      configuration variable and restart a daemon or two. You can start,
      stop, create, delete and reconfigure logical nodes on any machine
      running Flume from any command line in your network with the Flume
      jar available.

    2. Flume also has centralised liveness monitoring. We’ve heard a
      couple of stories of Scribe processes silently failing, but lying
      undiscovered for days until the rest of the Scribe installation
      starts creaking under the increased load. Flume allows you to see the
      health of all your logical nodes in one place (note that this is
      different from machine liveness monitoring; often the machine stays
      up while the process might fail).

    3. Flume supports three distinct types of reliability guarantees,
      allowing you to make tradeoffs between resource usage and
      reliability. In particular, Flume supports fully ACKed reliability,
      with the guarantee that all events will eventually make their way
      through the event flow.

    4. Flume’s also really extensible – it’s really easy to write your own
      source or sink and integrate most any system with Flume. If rolling
      your own is impractical, it’s often very straightforward to have your
      applications output events in a form that Flume can understand (Flume
      can run Unix processes, for example, so if you can use shell script
      to get at your data, you’re golden).

    This isn’t an exhaustive list of benefits to using Flume – I haven’t
    touched on using decorators for lightweight transformation or
    metadata extraction, the configuration language, the ability to run
    several logical nodes in a single Flume process, automatic bucketing
    and rolling of log files in HDFS… there’s lots more about Flume
    that we’re looking forward to sharing with everyone.

    The key difference to me is that Cloudera is actively supporting
    Flume. While I do generally trust Facebook to maintain great open
    source projects, Cloudera’s business is built around providing support
    for tools like this, so I have faith that Flume will longterm be
    better supported. I want to minimize the time I have to think about
    this particular problem. That said, so far I’ve had a lot of annoying
    issues where Flume was either a bit convoluted in its abstraction or
    buggy in its implementation, as you might expect from a pre-1.0
    technology. If Asana weren’t still in beta, I’d probably have chosen
    Scribe

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

Is there anyone who have encountered Processing Dirty Regions error in MyEclipse? Actually everytime
Is there anyone who knows this? I have been trying this for the last
Is there anyone who knows how to destroy a javascript (jquery) function? I'm using
Is there anyone who has already tried to use the Microsoft Bing translator web
Is there anyone out there who knows if it's possible to use common web
Anyone out there who knows what are the differences between BasicRenderEngine and LazyRenderEngine?
Anyone who's tried to study mathematics using online resources will have come across these
Is there any one who can point me to a Rails 3.2 multilingual starter
Can anyone who has used three.js tell me if its possible to detect webgl
Is there anyone knows about list of web servers which is used in embedded

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.