I have built an answer engine using wikipedia’s content with php. It just returns the most relevant wikipedia article for a particular query which in most of the cases turn out to be useles.The users tend to ask in a question format like “`what is the height of wayne rooney” and i want to give the exact height of rooney which is available in his wikipedia article. Can i solve this if i use nltk. Please advice.
I have built an answer engine using wikipedia’s content with php. It just returns
Share
in general: yes it is possible, BUT it is a very difficult task to build such a program.
what you are searching trying to build is called a ‘semantic search engine’ (see wikipedia) and there is a lot of research going on, how we can build a ‘semantic web’ and how to extract information of webpages, so that questions like the one you mentioned can be answered by computers, instead of just supplying links to relevant documents.
but the results are still far from perfect.
one of the better semantic search engines seems to be trueknowledge and of course the previously mentioned wolframalpha (which has its strengths in science).
if you really want to build such a semantic search engine the nltk might provide some helpful basic tools, but don’t expect it to be an easy task at all.