What are the best libraries for doing this. And is the quality good enouch to rely on. It will not be possible to train the system with the speakers voice or use a dictionary of terms to improve results.
Share
Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.
Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.
Lost your password? Please enter your email address. You will receive a link and will create a new password via email.
Please briefly explain why you feel this question should be reported.
Please briefly explain why you feel this answer should be reported.
Please briefly explain why you feel this user should be reported.
On windows, you want to use the SAPI interface (Speech API). There are multiple implementations. Microsoft includes a free one with Windows. Dragon NaturallySpeaking is a non-free one that I’ve seen used in the past for similar tasks (with effort). If the speakers are speaking clearly (and not overlapping and interrupting each other), the lack of training isn’t so crippling.
You won’t get a good transcript, though, the accuracy will be bad enough that what you get will be useful only for indexing. Large words and unique phrases will pop out nicely, especially if you create a custom dictionary (which I know you said you don’t want to do). For instance, finding all the news segments that mention ‘Pelosi’ and ‘public option’.