DocumentCode :
294524
Title :
Reducing word error rate on conversational speech from the Switchboard corpus
Author :
Jeanrenaud, P. ; Eide, E. ; Chaudhari, U. ; McDonough, J. ; Ng, K. ; Siu, M. ; Gish, H.
Author_Institution :
BBN Syst. & Technol. Corp., Cambridge, MA, USA
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
53
Abstract :
Speech recognition of conversational speech is a difficult task. The performance levels on the Switchboard corpus had been in the vicinity of 70% word error rate. In this paper, we describe the results of applying a variety of modifications to our speech recognition system and we show their impact on improving the performance on conversational speech. These modifications include the use of more complex models, trigram language models, and cross-word triphone models. We also show the effect of using additional acoustic training on the recognition performance. Finally, we present an approach to dealing with the abundance of short words, and examine how the variable speaking rate found in conversational speech impacts on the performance. Currently, the level of performance is at the vicinity of 50% error, a significant improvement over recent levels
Keywords :
error statistics; speech recognition; Switchboard corpus; acoustic training; complex models; conversational speech; cross-word triphone models; performance levels; reducing word error rate; short words; speech recognition; trigram language models; variable speaking rate; Air pollution; Error analysis; Performance analysis; Positron emission tomography; Speech recognition; Telephony; Testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479271
Filename :
479271
Link To Document :
بازگشت