Title :
Adaptation of large vocabulary recognition system parameters
Author :
Bahl, L. ; de Souza, P.V. ; Nahamoo, D. ; Picheny, M.A. ; Roukos, S.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
Abstract :
The authors report on a series of experiments in which the hidden Markov model baseforms and the language model probabilities were updated from spontaneously dictated speech captured during recognition sessions with the IBM Tangora system. The basic technique for baseform modification consisted of constructing new fenonic baseforms for all recognized words. To modify the language model probabilities, a simplified version of a cache language model was implemented. The word error rate across six talkers was 3.7%. Baseform adaptation reduced the average error rate to 3.5%, and using the cache language model reduced the error rate to 3.2%. Combining both techniques further reduced the error rate to 3.1%-a respectable improvement over the original error rate, especially given that the system was speaker-trained prior to adaptation
Keywords :
hidden Markov models; natural languages; speech recognition; vector quantisation; IBM Tangora system; cache language model; fenonic baseform construction; hidden Markov model baseforms; language model probabilities; large vocabulary recognition system parameters; prior speaker trained system; speaker adaptation; spontaneously dictated speech; word error rate; Error analysis; Hidden Markov models; Natural languages; Prototypes; Speech recognition; Statistics; Target recognition; Text recognition; Topology; Vocabulary;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
Conference_Location :
San Francisco, CA
Print_ISBN :
0-7803-0532-9
DOI :
10.1109/ICASSP.1992.225868