Title :
Asynchronous HMM with applications to speech recognition
Author :
Garg, Ashutosh ; Balakrishnan, Sreeram ; Vaithyanathan, Shivakumar
Author_Institution :
Almaden Res. Center, San Jose, CA, USA
Abstract :
We develop a novel formalism for modeling speech signals which are irregularly or incompletely sampled. This situation can arise in real world applications where the speech signal is being transmitted over an error prone channel where parts of the signal can be dropped. Typical speech systems based on hidden Markov models, cannot handle such data since HMMs rely on the assumption that observations are complete and made at regular intervals. We introduce the asynchronous HMM, a variant of the inhomogeneous HMM commonly used in bioinformatics, and show how it can be used to model irregularly or incompletely sampled data. A nested EM algorithm is presented in brief which can be used to learn the parameters of this asynchronous HMM. Evaluation on real world speech data, which has been modified to simulate channel errors, shows that this model and its variants significantly outperform the standard HMM and methods based on data interpolation.
Keywords :
hidden Markov models; learning (artificial intelligence); optimisation; signal sampling; speech recognition; asynchronous HMM; data interpolation; error prone channel; hidden Markov models; incomplete sampling; inhomogeneous HMM; irregular sampling; nested EM algorithm; speech recognition; speech signal modeling; Bioinformatics; Computational modeling; Computer errors; Computer vision; Hidden Markov models; Interpolation; Random variables; Speech analysis; Speech recognition; Text analysis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326159