I want my program to know if the string is a Noun, verb , adjective or etc. and it needs to be tokenized but how do i do that?
Do i need a Database for the definition of words? using Free context grammar?
Thank you.
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
It is much more complicated than that.
The answer to your immediate question is both. You both look up words to see what part of speech they are, as well as analyze sentence structure to determine part of speech.
Check out http://wordnet.princeton.edu/ to see about looking up parts of speech.
I recommend GATE/Annie as a open source framework for NLP. Apache has UIMA, although I haven’t worked with it. There is some compatibility between the projects, I believe.