How to import data from mysql to HDFS. I can’t use sqoop as it’s a HDFS installation not cloudera. I used below link to setup HDFS. My hadoop version is 0.20.2
http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/
How to import data from mysql to HDFS. I can’t use sqoop as it’s
Share
Apart from sqoop, you could try hiho. I have heard good things about it. (Never used it though)
But mostly what I have seen is people end up writing their own flows to do this. If hiho doesn;t work out, you can dump data from MySql using mysqlimport. Then load into HDFS using a map-reduce job or Pig/Hive.
I have heard Sqoop is pretty good and is widely used (This is hearsay again, I have never used it myself). Now that it is an apache incubator project, I think it might have started supporting apache releases of hadoop, or at least might have made it less painful for non-cloudera versions. The doc does say that it support Apache hadoop v0.21. Try to make it work with your hadoop version. It might not be that difficult.