I am planning to start an application which converts the speech to text in Linux. Are there any existing interfaces so that I can extend them? or Is there any such existing application in Linux? Any inputs on this?
EDIT: The application that I am planning to write should be able convert every word that we speak to text, not just the Yes/No.
Well, this is quite an undertaking and without saying what technology you want to use, here are some links:
Good luck. With more detail, we may be able to provide better answers. For example, there’s a big difference between ‘yes/no’ call center-style recognition vs. even partial natural language understanding.