I work on hadoop now, in Pseudo-distributed mode.
I try some mapreduce,package it as jar,and copy the file to hadoop.then use
./bin/hadoop jar *
to start it.
My question is:Is there any other way to do it?If we have thousands of jobs to run.We can’t just type in command.What we do in the PRODUCTION ENVIRONMENT?
thanks.
If you have 1000s of jobs, write a shell script and submit them if there are no dependencies between the jobs. If there are dependencies then use try using Apache Oozie as Chris mentioned.