Input for Hidden Markov Model-based speech recognition program

333 Views Asked by Barsik the Cat At 20 December 2016 at 02:58

I am going to build a speech recognition program based on Hidden Markov Model. Unfortunately, I don't know how to get an input sound sequence, and, well, work with it. Can anyone tell me what is the general approach for reading values from a sound file format (i.e. .wav, .mp3, etc)and slicing a soundtrack into pieces in C++?

Original Q&A

There are 1 best solutions below

Dmytro Prylipko On 24 December 2016 at 21:11 BEST ANSWER

The general approach is to convert an input sound into the sequence of feature vectors (usually, MFCCs). This process is described in general in CMU Sphinx wiki, and described in details in HTK Book. You might also want to study the general-purpose openSMILE toolkit to see how it is done in C++.

Input for Hidden Markov Model-based speech recognition program

There are 1 best solutions below

Related Questions in C++

Related Questions in SPEECH-RECOGNITION

Related Questions in HIDDEN-MARKOV-MODELS

Trending Questions

Popular # Hahtags

Popular Questions