I’m wonder upto what level of hadoop programming can be done using PHP or Ruby. I found articles taking about hadoop streaming api which can be hacked from PHP, Ruby.
My questions
-
Can you write a map-reduce job in PHP, Ruby which can work with other hadoop java map-reduce jobs?
-
In-terms of API level programming what is missing for non-java languages in hadoop? i.e something that can be done only in java right now vs other languages ?
Thanks
Personally, I also come from a Ruby/Python background, and at first tried using streaming to get things done. Eventually I decided to give the Java API a chance, and it turned out to be not too bad 🙂