How do I specify a custom RecordReader to use in job flow on Amazon EMR?
Note: Hadoop newbie here.
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
You specify input format in the your driver class (the class containtin “main” of the job. Record reader is logical part of Your input format which actually return it to the mapping task.
I do not think it is something specific to the EMR.