DocumentCode
3333863
Title
Connectionist speaker normalization and its applications to speech recognition
Author
Huang, X.D. ; Lee, K.F. ; Waibel, A.
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
1991
fDate
30 Sep-1 Oct 1991
Firstpage
357
Lastpage
366
Abstract
Speaker normalization may have a significant impact on both speaker-adaptive and speaker-independent speech recognition. In this paper, a codeword-dependent neural network (CDNN) is presented for speaker normalization. The network is used as a nonlinear mapping function to transform speech data between two speakers. The mapping function is characterized by two important properties. First, the assembly of mapping functions enhances overall mapping quality. Second, multiple input vectors are used simultaneously in the transformation. This not only makes full use of dynamic information but also alleviates possible errors in the supervision data. Large-vocabulary continuous speech recognition is chosen to study the effect of speaker normalization. Using speaker-dependent semi-continuous hidden Markov models, performance evaluation over 360 testing sentences from new speakers showed that speaker normalization significantly reduced the error rate from 41.9% to 5.0% when only 40 speaker-dependent sentences were used to estimate CDNN parameters
Keywords
hidden Markov models; neural nets; speech coding; speech recognition; codeword-dependent neural network; connectionist; continuous speech recognition; dynamic information; error; hidden Markov models; multiple input vectors; nonlinear mapping function; performance; speaker normalization; Application software; Assembly; Computer science; Error analysis; Hidden Markov models; Loudspeakers; Neural networks; Parameter estimation; Speech recognition; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Networks for Signal Processing [1991]., Proceedings of the 1991 IEEE Workshop
Conference_Location
Princeton, NJ
Print_ISBN
0-7803-0118-8
Type
conf
DOI
10.1109/NNSP.1991.239506
Filename
239506
Link To Document