DocumentCode :
1066764
Title :
Speech Recognition Using Augmented Conditional Random Fields
Author :
Hifny, Yasser ; Renals, Steve
Author_Institution :
T.J. Watson Res. Center, IBM, Yorktown Heights, NY
Volume :
17
Issue :
2
fYear :
2009
Firstpage :
354
Lastpage :
365
Abstract :
Acoustic modeling based on hidden Markov models (HMMs) is employed by state-of-the-art stochastic speech recognition systems. Although HMMs are a natural choice to warp the time axis and model the temporal phenomena in the speech signal, their conditional independence properties limit their ability to model spectral phenomena well. In this paper, a new acoustic modeling paradigm based on augmented conditional random fields (ACRFs) is investigated and developed. This paradigm addresses some limitations of HMMs while maintaining many of the aspects which have made them successful. In particular, the acoustic modeling problem is reformulated in a data driven, sparse, augmented space to increase discrimination. Acoustic context modeling is explicitly integrated to handle the sequential phenomena of the speech signal. We present an efficient framework for estimating these models that ensures scalability and generality. In the TIMIT phone recognition task, a phone error rate of 23.0% was recorded on the full test set, a significant improvement over comparable HMM-based systems.
Keywords :
Markov processes; random processes; speech recognition; TIMIT phone recognition task; acoustic modeling based HMM; augmented conditional random field; hidden Markov model; speech recognition; stochastic process; temporal phenomena; Acoustic testing; Automatic speech recognition; Context modeling; Error analysis; Feature extraction; Hidden Markov models; Scalability; Speech recognition; Stochastic systems; System testing; Augmented conditional random fields (ACRFs); augmented spaces; discriminative compression; hidden Markov models (HMMs);
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2008.2010286
Filename :
4749472
Link To Document :
بازگشت