I need to test various famous classification methods like kNN, ID3 and … on a huge data-set of a project, and choose one for future use.
I have no limitation on language but performance and readable code both in learning and classification phase are very important.
therefore, I’m looking for a good library with following features:
- includes various classification methods
- high performance
- easily usable
any suggestions?
Take a look at RapidMiner which comes with a Java-API and graphical tools for data mining. The community edition is free, I think.
I used the predecessor of this tool/library as a student but do not have professional experience with it, though.