To effectively utilise map-reduce jobs in Hadoop , i need data to be stored

Question

0

Asked: May 20, 20262026-05-20T17:42:08+00:00 2026-05-20T17:42:08+00:00

To effectively utilise map-reduce jobs in Hadoop , i need data to be stored

0

To effectively utilise map-reduce jobs in Hadoop, i need data to be stored in hadoop’s sequence file format. However,currently the data is only in flat .txt format.Can anyone suggest a way i can convert a .txt file to a sequence file?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-05-20T17:42:09+00:00

So the way more simplest answer is just an “identity” job that has a SequenceFile output.

Looks like this in java:

    public static void main(String[] args) throws IOException,
        InterruptedException, ClassNotFoundException {

    Configuration conf = new Configuration();
    Job job = new Job(conf);
    job.setJobName("Convert Text");
    job.setJarByClass(Mapper.class);

    job.setMapperClass(Mapper.class);
    job.setReducerClass(Reducer.class);

    // increase if you need sorting or a special number of files
    job.setNumReduceTasks(0);

    job.setOutputKeyClass(LongWritable.class);
    job.setOutputValueClass(Text.class);

    job.setOutputFormatClass(SequenceFileOutputFormat.class);
    job.setInputFormatClass(TextInputFormat.class);

    TextInputFormat.addInputPath(job, new Path("/lol"));
    SequenceFileOutputFormat.setOutputPath(job, new Path("/lolz"));

    // submit and wait for completion
    job.waitForCompletion(true);
   }

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

To effectively utilise map-reduce jobs in Hadoop , i need data to be stored

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply