I started learning Hadoop, and am a bit confused by MapReduce. For tasks where

Question

0

Asked: June 12, 20262026-06-12T14:39:57+00:00 2026-06-12T14:39:57+00:00

I started learning Hadoop, and am a bit confused by MapReduce. For tasks where

0

I started learning Hadoop, and am a bit confused by MapReduce. For tasks where result natively is a list of key-value pairs everything seems clear. But I don’t understand how should I solve the tasks where result is a single value (say, sum of squared input decimals, or centre of mass for input points).

On the one hand I can put all results of mapper to the same key. But as far as I understood in this case the only reducer will manage the whole set of data (calculate sum, or mean coordinates). It doesn’t look like a good solution.

Another one that I can imaging is to group mapper results. Say, mapper that processed examples 0-999 will produce key equals to 0, 1000-1999 will produce key equals to 1, and so on. As far as there still will be multiple results of reducers, it will be necessary to build chain of reducers (reducing will be repeated until only one result remains). It looks much more computational effective, but a bit complicated.

I still hope that Hadoop has the off-the-shelf tool that executes superposition of reducers to maximise the efficiency of reducing the whole data to a single value. Although I failed to find one.

What is the best practise of solving the tasks where result is a single value?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-12T14:39:58+00:00

Editorial Team

2026-06-12T14:39:58+00:00Added an answer on June 12, 2026 at 2:39 pm

If you are able to reformulate your task in terms of commutative reduce you should look at Combiners. Any way you should take a look on it, it can significantly reduce amount data to shuffle.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I started learning Hadoop, and am a bit confused by MapReduce. For tasks where

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply