DocumentCode :
2022978
Title :
Unified stochastic engine (USE) for speech recognition
Author :
Huang, X. ; Belin, M. ; Alleva, E. ; Hwang, M.
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Volume :
2
fYear :
1993
fDate :
27-30 April 1993
Firstpage :
636
Abstract :
A unified stochastic engine (USE) that jointly optimizes both acoustic and language models is presented. In the USE, not only can one iteratively adjust language probabilities to fit the given acoustic representations, but one can also adjust acoustic models (including feature representation) guided by language constraints. From the language modeling point of view, the USE makes it possible to encode acoustically confusable words in the language probabilities. From the acoustic modeling point of view, the language-constraint approach makes it possible to focus on acoustic words for which language models lack enough discrimination capacity. The authors report preliminary experimental results for Wall Street Journal continuous 5000-word speaker-independent dictation. The error rate is reduced from 7.3% to 6.9% with the proposed method.<>
Keywords :
coding errors; dictation; iterative methods; speech coding; speech recognition; stochastic automata; Wall Street Journal; acoustic models; discrimination capacity; error rate; feature representation; language constraints; language models; language probabilities; speaker-independent dictation; speech recognition; unified stochastic engine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.1993.319386
Filename :
319386
Link To Document :
بازگشت