Is there any tool that provides language recognition functionality?
Like if I input a sentence of English, it will suggest that the string may be English.
I think I need one of the following stuff.
- Python language recognition library.
- Web service that provides such functionality.
Anyone can help?
I have used the GuessLanguage module pretty extensively. It detects 100+ languages and works really well for LONG pieces of text. English is very good. Other languages I am not sure. If you want to process short sentences at a time or tweets it is pretty bad as it was designed for doing detection on full articles.
My company (social media analytics) is currently looking for a replacement package as well and are exploring NLTK as an option.