I’m rewriting a MongoDB map reduce job to use Hadoop instead (using the mongo-hadoop

Question

0

Asked: June 9, 20262026-06-09T13:38:56+00:00 2026-06-09T13:38:56+00:00

I’m rewriting a MongoDB map reduce job to use Hadoop instead (using the mongo-hadoop

0

I’m rewriting a MongoDB map reduce job to use Hadoop instead (using the mongo-hadoop connector), but when I map two datasets to the same collection, it overwrites the values instead of using them

{ reduce : “collectionName” } – If documents exists for a given key in the result set and in the old collection, then a reduce operation (using the specified reduce function) will be performed on the two values and the result will be written to the output collection. If a finalize function was provided, this will be run after the reduce as well.

How is done using mongo-hadoop?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-09T13:38:57+00:00

Editorial Team

2026-06-09T13:38:57+00:00Added an answer on June 9, 2026 at 1:38 pm

To anyone else looking for this, support for multiple input is coming soon.

The branch with the change is located here. It’s pretty well done, we’re using it in production.

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m rewriting a MongoDB map reduce job to use Hadoop instead (using the mongo-hadoop

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply