I’m designing and coding a knowledge based community sharing system (forum, Q&A, article sharing between students, professors and experts) in Java, for the web.
I need to use some data mining/text processing techniques/algorithms to analyse the discussions between experts and students (discussions are categorized using tags) and create proper notes and compilations on specific similar topics.
I’m not an expert regarding such algorithms or tools available. It’d be great if anyone can provide me with some pointers or explain how I can proceed with this problem.
Thanks!!
For categorization of articles you can use LSA (Latent Semantic Analysis) technique .
You can check these tools for text processing.
LingPipe : Tool kit for processing text.
Lucene : Text mining
Solr : Powerful text search tool