DocumentCode :
302303
Title :
Speaker normalization using efficient frequency warping procedures
Author :
Lee, Li ; Rose, Richard C.
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
353
Abstract :
In an effort to reduce the degradation in speech recognition performance caused by variation in vocal tract shape among speakers, a frequency warping approach to speaker normalization is investigated. A set of low complexity, maximum likelihood based frequency warping procedures have been applied to speaker normalization for a telephone based connected digit recognition task. This paper presents an efficient means for estimating a linear frequency warping factor and a simple mechanism for implementing frequency warping by modifying the filter-bank in mel-frequency cepstrum feature analysis. An experimental study comparing these techniques to other well-known techniques for reducing variability is described. The results showed that frequency warping was consistently able to reduce word error rate by 20% even for very short utterances
Keywords :
cepstral analysis; frequency estimation; hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; degradation; efficient frequency warping procedures; filter-bank; linear frequency warping factor; maximum likelihood based frequency warping; mel-frequency cepstrum feature analysis; short utterances; speaker normalization; speech recognition performance; telephone based connected digit recognition task; vocal tract shape; word error rate; Cepstral analysis; Cepstrum; Degradation; Error analysis; Frequency estimation; Maximum likelihood estimation; Nonlinear filters; Shape; Speech recognition; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.541105
Filename :
541105
Link To Document :
بازگشت