DocumentCode
417302
Title
Asynchronous HMM with applications to speech recognition
Author
Garg, Ashutosh ; Balakrishnan, Sreeram ; Vaithyanathan, Shivakumar
Author_Institution
Almaden Res. Center, San Jose, CA, USA
Volume
1
fYear
2004
fDate
17-21 May 2004
Abstract
We develop a novel formalism for modeling speech signals which are irregularly or incompletely sampled. This situation can arise in real world applications where the speech signal is being transmitted over an error prone channel where parts of the signal can be dropped. Typical speech systems based on hidden Markov models, cannot handle such data since HMMs rely on the assumption that observations are complete and made at regular intervals. We introduce the asynchronous HMM, a variant of the inhomogeneous HMM commonly used in bioinformatics, and show how it can be used to model irregularly or incompletely sampled data. A nested EM algorithm is presented in brief which can be used to learn the parameters of this asynchronous HMM. Evaluation on real world speech data, which has been modified to simulate channel errors, shows that this model and its variants significantly outperform the standard HMM and methods based on data interpolation.
Keywords
hidden Markov models; learning (artificial intelligence); optimisation; signal sampling; speech recognition; asynchronous HMM; data interpolation; error prone channel; hidden Markov models; incomplete sampling; inhomogeneous HMM; irregular sampling; nested EM algorithm; speech recognition; speech signal modeling; Bioinformatics; Computational modeling; Computer errors; Computer vision; Hidden Markov models; Interpolation; Random variables; Speech analysis; Speech recognition; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-8484-9
Type
conf
DOI
10.1109/ICASSP.2004.1326159
Filename
1326159
Link To Document