Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6966143
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 27, 20262026-05-27T16:09:13+00:00 2026-05-27T16:09:13+00:00

I was just going through the word count example in MapReduce. The map function

  • 0

I was just going through the word count example in MapReduce. The map function is very straightforward. Is there a higher level function that decides what part of the file go to what mapper?
Suppose you are relying on a function (such as SHA1) that relies on the input of the entire file, is there any to tell the framework not to split files?

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-27T16:09:14+00:00Added an answer on May 27, 2026 at 4:09 pm

    Is there a higher level function that decides what part of the file go to what mapper?

    When a map slot is free on a node, the scheduler picks a split which is nearest to the node to avoid data transfer as much as possible. If an unprocessed input split is on the same node as the free map slot then that split is processed, if not then a split in the same rack is chosen or else a split outside the rack is chosen.

    is there any to tell the framework not to split files?

    Implement the FileInputFormat#isSplitable(). Then the input files are not split and are processed one per map.

    import org.apache.hadoop.fs.*;
    import org.apache.hadoop.mapred.TextInputFormat;
    public class NonSplittableTextInputFormat extends TextInputFormat {
        @Override
        protected boolean isSplitable(FileSystem fs, Path file) {
            return false;
        }
    }
    
    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm just going through perlxstut and I found there newSVnv in EXAMPLE 5 and
I was just going through the following URL, and i noticed that when i
Just going through the sample Scala code on Scala website, but encountered an annoying
I just downloaded MVC and I am going through a tutorial. Everything goes fine
I am going through the various Google AppEngine tutorials sometimes, and I just noticed
I am just going through the WPF concepts I came across these routed events
I was just going through some codes of C++. Where in I came across
I was just going through Design Patterns Questions on Stackoverflow and everyone mentioned about
I was just going through programming structure studies; In particular I was studying concurrent
i was just going through the documentation given on developer.android.com and when i was

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.