Let’s say I’m using cleanup() functions in Hadoop MapReduce. How would I add a progress tracking mechanism inside it, let’s say in percentage complete, to display it in console?
Share
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
There is no implementation of the cleanup() of the Mapper.java class.
When a Hadoop job is run from the command prompt, the following is printed on the console.
11/10/31 18:15:50 INFO mapreduce.Job: map 0% reduce 0%
11/10/31 18:16:15 INFO mapreduce.Job: map 50% reduce 0%
11/10/31 18:16:21 INFO mapreduce.Job: map 100% reduce 0%
11/10/31 18:16:30 INFO mapreduce.Job: map 100% reduce 100%
The code for the above is in the Job.java class.
Mapper.cleanup() code has to be modified to print the progress to the console and jar file built. I don’t think there is OOB support for the cleanup in Hadoop.