DocumentCode
3161854
Title
A comparative study of discriminative training using non-uniform criteria for cross-layer acoustic modeling
Author
Weng, Chao ; Juang, Biing-Hwang
Author_Institution
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
4089
Lastpage
4092
Abstract
This work focuses on a comparative study of discriminative training using non-uniform criteria for cross-layer acoustic modeling. Two kinds of discriminative training (DT) frameworks, minimum classification error like (MCE-like) and minimum phone error like (MPE-like) DT frameworks, are augmented to allow the error cost embedding at the phoneme (model) level respectively. To facilitate this comparative study, we implement both augmented DT frameworks under the same umbrella, using the error cost derived from the same cross-layer confusion matrix. Experiments on a large vocabulary task WSJ0 demonstrated the effectiveness of both DT frameworks with the formulated non-uniform error cost embedded. Several preliminary investigations on the effect of the dynamic range of error cost are also presented.
Keywords
matrix algebra; speech recognition; comparative study; cross-layer acoustic modeling; cross-layer confusion matrix; discriminative training frameworks; large vocabulary task WSJ0; minimum classification error like DT frameworks; minimum phone error like DT frameworks; nonuniform criteria; nonuniform error cost embedded; speech recognition; Accuracy; Acoustics; Dynamic range; Hidden Markov models; Linear programming; Speech recognition; Training; cross-layer acoustic modeling; discriminative training; non-uniform error cost; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288817
Filename
6288817
Link To Document