DocumentCode
284607
Title
Adaptation of large vocabulary recognition system parameters
Author
Bahl, L. ; de Souza, P.V. ; Nahamoo, D. ; Picheny, M.A. ; Roukos, S.
Author_Institution
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Volume
1
fYear
1992
fDate
23-26 Mar 1992
Firstpage
477
Abstract
The authors report on a series of experiments in which the hidden Markov model baseforms and the language model probabilities were updated from spontaneously dictated speech captured during recognition sessions with the IBM Tangora system. The basic technique for baseform modification consisted of constructing new fenonic baseforms for all recognized words. To modify the language model probabilities, a simplified version of a cache language model was implemented. The word error rate across six talkers was 3.7%. Baseform adaptation reduced the average error rate to 3.5%, and using the cache language model reduced the error rate to 3.2%. Combining both techniques further reduced the error rate to 3.1%-a respectable improvement over the original error rate, especially given that the system was speaker-trained prior to adaptation
Keywords
hidden Markov models; natural languages; speech recognition; vector quantisation; IBM Tangora system; cache language model; fenonic baseform construction; hidden Markov model baseforms; language model probabilities; large vocabulary recognition system parameters; prior speaker trained system; speaker adaptation; spontaneously dictated speech; word error rate; Error analysis; Hidden Markov models; Natural languages; Prototypes; Speech recognition; Statistics; Target recognition; Text recognition; Topology; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location
San Francisco, CA
ISSN
1520-6149
Print_ISBN
0-7803-0532-9
Type
conf
DOI
10.1109/ICASSP.1992.225868
Filename
225868
Link To Document