Do you know of any python mapreduce ready clustering libraries?
I have found some good libraries in Java (http://lucene.apache.org/mahout/), I’d prefer to use python though.
https://github.com/klbostee/dumbo/wiki (Python mapreduce API )
Edit —
I’m looking for mapreduce ready : Canopy, K-means, Means-shift,etc..
You can use Python in combination with Hadoop, if you like:
http://www.michael-noll.com/wiki/Writing_An_Hadoop_MapReduce_Program_In_Python