DocumentCode
3188540
Title
Evolution of the performance of automatic speech recognition algorithms in transcribing conversational telephone speech
Author
Padmanabhan, M. ; Saon, G. ; Zweig, G. ; Huang, J. ; Kingsbury, B. ; Mangu, L.
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
3
fYear
2001
fDate
2001
Firstpage
1926
Abstract
Research in the speech recognition speech-to-text conversion area has been underway for a couple of decades, and a great deal of progress has been made in reducing the word error rate. In this paper, we attempt to summarize the state of the art in speech recognition algorithms. The algorithms we describe span the areas of lexicon design, feature extraction, classifier design, combination of hypotheses, and speaker adaptation of acoustic models. We will benchmark the algorithms on two main sources of speech, the first being Voicemail (conversational telephone speech from a single speaker) and the second being Switchboard (conversational telephone speech between two speakers). We also present the results of some cross-domain experiments which highlight the “brittleness” of speech recognition systems today and illustrates the need to focus research effort on improving cross-domain performance
Keywords
cepstral analysis; feature extraction; speech recognition; voice mail; MAP decoding; Mel cepstrum; Switchboard; Voicemail; acoustic models; automatic speech recognition algorithms; boosting; classifier design; combination of hypotheses; conversational telephone speech transcribing; cross-domain experiments; feature extraction; formant frequencies; lexicon design; performance evolution; speaker adaptation; spectral peaks; speech-to-text conversion; spontaneous speech; system brittleness; word error rate; Algorithm design and analysis; Automatic speech recognition; Databases; Error analysis; Loudspeakers; Speech recognition; Telephony; Testing; Vocabulary; Voice mail;
fLanguage
English
Publisher
ieee
Conference_Titel
Instrumentation and Measurement Technology Conference, 2001. IMTC 2001. Proceedings of the 18th IEEE
Conference_Location
Budapest
ISSN
1091-5281
Print_ISBN
0-7803-6646-8
Type
conf
DOI
10.1109/IMTC.2001.929536
Filename
929536
Link To Document