I’m starting with Hadoop framework, my task is to write map-reduce application for the

Question

0

Asked: June 13, 20262026-06-13T23:09:25+00:00 2026-06-13T23:09:25+00:00

I’m starting with Hadoop framework, my task is to write map-reduce application for the

0

I’m starting with Hadoop framework, my task is to write map-reduce application for the framework and and submit it. I have to use version 0.22.0 of Hadoop. I’m just learning basic concepts and API. However I find it very hard to learn it and to program some prototypes because both the official documentation and API javadocs are outdated, incomplete, generally chaotic and even non-existing.

Here are just few thinks that I do not understand: The MapReduce tutorial for Hadoop 0.22.0 uses constructor (here, line 101) of class Job that is deprecated. All other constructors are also deprecated. There is no note in the javadocs about what is to be used instead. There are static methods of class Job that return instance of Job but those methods are undocumented and they require instance of poorly documented class Cluster as parameter. So after reading all that mess I still don’t know how to properly get instance of Job. Any help on this is appreciated.

When I tried to find out the answer in tutorial to later versions like 1.0.4 stable I found out that mapreduce tutorial for that version uses all the classes from package org.apache.hadoop.mapred that are deprecated in version 0.22.0. So 0.22.0 is more resent then 1.0.4. Please help me understand this. Or suggest some better resources.

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-13T23:09:26+00:00

The Javadoc might be a bit confusing, therefore having a look at the source of the Job class will probably help you:

  ...
  @Deprecated
  public Job() throws IOException {
    this(new Configuration());
  }

  @Deprecated
  public Job(Configuration conf) throws IOException {
    this(new Cluster(conf), conf);
  }

  @Deprecated
  public Job(Configuration conf, String jobName) throws IOException {
    this(conf);
    setJobName(jobName);
  }

  Job(Cluster cluster) throws IOException {
    this(cluster, new Configuration());
  }

  Job(Cluster cluster, Configuration conf) throws IOException {
    super(conf, null);
    this.cluster = cluster;
  }

  ...
  public static Job getInstance(Cluster cluster, Configuration conf) 
      throws IOException {
    return new Job(cluster, conf);
  }

So you can use:

...
Configuration conf = getConf();
Job job = Job.getInstance(new Cluster(conf), conf);

Note that instantiating the Job class in this way will create at the same time a connection to the job tracker as well.

If you want to defer doing so, you have the option to lazily initialize this connection by setting Cluster to null when creating the Job object. In this case you will let the Job class to make the connection when it’s really needed (see further information here) :

Job job = Job.getInstance(conf);

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

I’m starting with Hadoop framework, my task is to write map-reduce application for the

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply