This is my first time using map/reduce. I want to write a program that

Question

0

Asked: May 25, 20262026-05-25T23:19:23+00:00 2026-05-25T23:19:23+00:00

This is my first time using map/reduce. I want to write a program that

0

This is my first time using map/reduce. I want to write a program that processes a large log file. For example, if I was processing a log file that had records consisting of {Student, College, and GPA}, and wanted to sort all students by college, what would be the ‘map’ part and what would be the ‘reduce’ part? I am having some difficulty with the concept, despite having gone over a number of tutorials and examples.

Thanks!

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-25T23:19:24+00:00

Technically speaking, Hadoop MapReduce treats everything as key-value pairs; you just need to define what the keys are and what the values are. The signatures of map and reduce are

map: (K1 x V1) -> (K2 x V2) list
reduce: (K2 x V2) list -> (K3 x V3) list

with sorting taking place on K2 values in the intermediate shuffle phase between map and reduce.

If your inputs are of the form

Student x (College x GPA)

Then your mapper should do nothing more than get the College values to the key:

map: (s, c, g) -> [(c, s, g)]

with college as the new key, Hadoop will sort by college for you. Your reducer then, is just a plain old “identity reducer.”

If you are carrying out a sorting operation in practice (that is, this isn’t a homework problem), then check out Hive, or Pig. These systems drastically simplify these kinds of tasks. Sorting on a particular column becomes quite trivial. However, it is always educational to write, say, a hadoop streaming job for tasks like the one you identified here, to give you a better understanding of mappers and reducers.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

This is my first time using map/reduce. I want to write a program that

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply