How can I identify the Input Formats in MapReduce Program

77 Views Asked by Harsh At 05 June 2025 at 20:12

I just started learning Hadoop and there are various formats of input types. I have few programs to study and my main question is how can I identify if the input format is TextInputFormat or KeyValueTextInputFormat or any other. Your help is really appreciated

Original Q&A

There are 1 best solutions below

philantrovert On 08 September 2017 at 06:37

You don't have to identify which InputFormat is being used by the MapReduce program.

InputFormat is something that you can specify in your program explicitly and the MapReduce job will use that.

If you don't specify anything, it uses the default which is TextInputFormat which extends FileInputFormat<LongWritable, Key>. That's why in a simple wordcount program, you would often see the Mapper class defined as :

public class MyMapper extends Mapper<LongWritable, Key, Text, IntWritable> {
    //...
}

You can specify the InputFormat to use in the JobConf object :

JobConf job = new JobConf(new Configuration(), MyJob.class);

job.setInputFormat(SequenceFileInputFormat.class);
job.setOutputFormat(SequenceFileOutputFormat.class);

Link to: InputFormat.class for further reading.

How can I identify the Input Formats in MapReduce Program

There are 1 best solutions below

Related Questions in HADOOP

Related Questions in MAPREDUCE

Related Questions in HADOOP2

Related Questions in MAPR

Trending Questions

Popular # Hahtags

Popular Questions