مرکز منطقه ای اطلاع رساني علوم و فناوري - Unified stochastic engine (USE) for speech recognition

DocumentCode :

2022978

Title :

Unified stochastic engine (USE) for speech recognition

Author :

Huang, X. ; Belin, M. ; Alleva, E. ; Hwang, M.

Author_Institution :

Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA

Volume :

fYear :

1993

fDate :

27-30 April 1993

Firstpage :

636

Abstract :

A unified stochastic engine (USE) that jointly optimizes both acoustic and language models is presented. In the USE, not only can one iteratively adjust language probabilities to fit the given acoustic representations, but one can also adjust acoustic models (including feature representation) guided by language constraints. From the language modeling point of view, the USE makes it possible to encode acoustically confusable words in the language probabilities. From the acoustic modeling point of view, the language-constraint approach makes it possible to focus on acoustic words for which language models lack enough discrimination capacity. The authors report preliminary experimental results for Wall Street Journal continuous 5000-word speaker-independent dictation. The error rate is reduced from 7.3% to 6.9% with the proposed method.<>

Keywords :

coding errors; dictation; iterative methods; speech coding; speech recognition; stochastic automata; Wall Street Journal; acoustic models; discrimination capacity; error rate; feature representation; language constraints; language models; language probabilities; speaker-independent dictation; speech recognition; unified stochastic engine;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on

Conference_Location :

Minneapolis, MN, USA

ISSN :

1520-6149

Print_ISBN :

0-7803-7402-9

Type :

conf

DOI :

10.1109/ICASSP.1993.319386

Filename :

319386

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2022978