Sign Up

Sign Up to our social questions and Answers Engine to ask questions, answer people’s questions, and connect with other people.

Have an account? Sign In

Have an account? Sign In Now

Sign In

Login to our social questions & Answers Engine to ask questions answer people’s questions & connect with other people.

Sign Up Here

Forgot Password?

Don't have account, Sign Up Here

Forgot Password

Lost your password? Please enter your email address. You will receive a link and will create a new password via email.

Have an account? Sign In Now

You must login to ask a question.

Forgot Password?

Need An Account, Sign Up Here

Please briefly explain why you feel this question should be reported.

Please briefly explain why you feel this answer should be reported.

Please briefly explain why you feel this user should be reported.

Sign InSign Up

The Archive Base

The Archive Base Logo The Archive Base Logo

The Archive Base Navigation

  • SEARCH
  • Home
  • About Us
  • Blog
  • Contact Us
Search
Ask A Question

Mobile menu

Close
Ask a Question
  • Home
  • Add group
  • Groups page
  • Feed
  • User Profile
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Buy Points
  • Users
  • Help
  • Buy Theme
  • SEARCH
Home/ Questions/Q 6905915
In Process

The Archive Base Latest Questions

Editorial Team
  • 0
Editorial Team
Asked: May 27, 20262026-05-27T08:14:33+00:00 2026-05-27T08:14:33+00:00

I am exploring the SpeechRecognitionEngine ‘s capabilities, and my end goal is to input

  • 0

I am exploring the SpeechRecognitionEngine‘s capabilities, and my end goal is to input a WAV file and a transcription of that WAV file, and to output the positions in the WAV file of the beginning (and ideally, end) of each word.

I can get the engine to recognize the phrase successfully, but I can not understand how to retrieve the audio positions when the word starts, not when the recognition was hypothesized or recognized, etc.

If you’re curious what the point of this is, it is in automating lipsync animation workflows.

Thanks for your time.

  • 1 1 Answer
  • 0 Views
  • 0 Followers
  • 0
Share
  • Facebook
  • Report

Leave an answer
Cancel reply

You must login to add an answer.

Forgot Password?

Need An Account, Sign Up Here

1 Answer

  • Voted
  • Oldest
  • Recent
  • Random
  1. Editorial Team
    Editorial Team
    2026-05-27T08:14:34+00:00Added an answer on May 27, 2026 at 8:14 am

    Proper audio to text alignment is a task which requires specific algorithms different from the speech recognition. You can emulate some alignment functionality with ASR engine, but it will work good.

    For the implementations of the alignment algorithms you can check CMUSphinx speech recognition toolkit:

    http://cmusphinx.sourceforge.net/?s=long+audio+alignment

    http://www.bluevincent.com/2011/02/speech-to-text-using-java.html

    Or you can try commercial company service like the one from Nexiwave

    http://nexiwave.com/index.php/applications/transcription-timestamping

    • 0
    • Reply
    • Share
      Share
      • Share on Facebook
      • Share on Twitter
      • Share on LinkedIn
      • Share on WhatsApp
      • Report

Sidebar

Related Questions

I'm exploring the file upload and text parsing capabilities of PHP, but my first
Exploring around S3's UI, it seems they only enjoy file uploads from my local
I just started exploring Scala in my free time. I have to say that
While exploring a recent Linq question i noticed that the algorithm seemed rather slow.
I’m exploring poll() function on a small project of mine, and I noticed that
I have been exploring algorithms that require some work on matrices, and I have
I'm exploring CouchDB this morning, and am playing with a document schema that looks
while exploring jQuery I came up with the following weird script. I don't see
I'm exploring the possibility of writing an application in Erlang, but it would need
I've been exploring different strategies for running integration tests within some Nant build scripts.

Explore

  • Home
  • Add group
  • Groups page
  • Communities
  • Questions
    • New Questions
    • Trending Questions
    • Must read Questions
    • Hot Questions
  • Polls
  • Tags
  • Badges
  • Users
  • Help
  • SEARCH

Footer

© 2021 The Archive Base. All Rights Reserved
With Love by The Archive Base

Insert/edit link

Enter the destination URL

Or link to existing content

    No search term specified. Showing recent items. Search or use up and down arrow keys to select an item.