I am learning python and django. I need to do extensive stuff for data parsing , stats , AI and data mining. There are many libraries available. so I want to know which SET I should learn. Currently I have the following SET in My Mind.
- Parsing: Beautiful Soup
- Search Engine: Whoosh (Will it crawl and index the website or not??)
- AI: PyBrain
- Data Mining: orange (I am not sure bout this, Anything else)
So I want to know is there anything else which I need to learn for that fields.
Or Some libraries which someone has tried. basically I want libraries which i can use from inside Django , I don’t want different frameworks for those.
In addition to the scikit cited by @aix, you might want to take a look at the following libraries:
I’ll really second investigating orange capabilities which is a full-featured application for data mining, but you can call it from external scripts, see e.g. the Beginning with Orange tutorial to get an idea.