Adaptation of large vocabulary recognition system parameters

Author

Bahl, L. ; de Souza, P.V. ; Nahamoo, D. ; Picheny, M.A. ; Roukos, S.

Author_Institution

IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA

Volume

1

fYear

1992

fDate

23-26 Mar 1992

Firstpage

477

Abstract

The authors report on a series of experiments in which the hidden Markov model baseforms and the language model probabilities were updated from spontaneously dictated speech captured during recognition sessions with the IBM Tangora system. The basic technique for baseform modification consisted of constructing new fenonic baseforms for all recognized words. To modify the language model probabilities, a simplified version of a cache language model was implemented. The word error rate across six talkers was 3.7%. Baseform adaptation reduced the average error rate to 3.5%, and using the cache language model reduced the error rate to 3.2%. Combining both techniques further reduced the error rate to 3.1%-a respectable improvement over the original error rate, especially given that the system was speaker-trained prior to adaptation

Keywords

hidden Markov models; natural languages; speech recognition; vector quantisation; IBM Tangora system; cache language model; fenonic baseform construction; hidden Markov model baseforms; language model probabilities; large vocabulary recognition system parameters; prior speaker trained system; speaker adaptation; spontaneously dictated speech; word error rate; Error analysis; Hidden Markov models; Natural languages; Prototypes; Speech recognition; Statistics; Target recognition; Text recognition; Topology; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on

Conference_Location

San Francisco, CA

ISSN

1520-6149

Print_ISBN

0-7803-0532-9

Type

conf

DOI

10.1109/ICASSP.1992.225868

Filename

225868