I am learning to work on hadoop cluster. I have worked for some time

Question

0

Asked: June 3, 20262026-06-03T13:39:21+00:00 2026-06-03T13:39:21+00:00

I am learning to work on hadoop cluster. I have worked for some time

0

I am learning to work on hadoop cluster. I have worked for some time on hadoop streaming where I coded map-reduce scripts in perl/python and ran the job.
However, I didn’t find any good explanation for running a java map reduce job.
For example:
I have the following program-

http://www.infosci.cornell.edu/hadoop/wordcount.html

Can somebody tell me how shall I actually compile this program and run the job.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-03T13:39:23+00:00

Create a directory to hold the compiled class:

mkdir WordCount_classes

Compile your class:

javac -classpath ${HADOOP_HOME}/hadoop-${HADOOP_VERSION}-core.jar -d WordCount_classes WordCount.java

Create a jar file from your compiled class:

jar -cvf $HOME/code/hadoop/WordCount.jar -C WordCount_classes/ .

Create a directory for your input and copy all your input files into it, then run your job as follows:

bin/hadoop jar $HOME/code/WordCount.jar WordCount ${INPUTDIR} ${OUTPUTDIR}

The output of your job will be put in the ${OUTPUTDIR} directory. This directory is created by the Hadoop job, so make sure it doesn’t exist before you run the job.

See here for a full example.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I am learning to work on hadoop cluster. I have worked for some time

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply