I am using Hadoop map-reduce program, where I want to represent part of the

Question

0

Asked: June 18, 20262026-06-18T09:26:18+00:00 2026-06-18T09:26:18+00:00

I am using Hadoop map-reduce program, where I want to represent part of the

0

I am using Hadoop map-reduce program, where I want to represent part of the file as key. This I want to use to do for some analytics. However I found this has brought the performance. Can anyone please tell if there are any alternative to using large chunk of text. Can we encode it in any other format. I have also found by converting strings to byte or binary format. But still I am not able to store it in integer datatype. I tried converting it to BigInteger but in vain, since there are also collisions happening when reducing the text which are not similar. How to represent large chunk of text as key in mapper other than using Text datatype.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-18T09:26:19+00:00

Editorial Team

2026-06-18T09:26:19+00:00Added an answer on June 18, 2026 at 9:26 am

How long can the part of your file be? How similar are the keys to each other? Have you considered using the MD5 hash (or similar) of the text as the key in your mapper?

0

Reply
Share
Share

- Report

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am using Hadoop map-reduce program, where I want to represent part of the

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply