DocumentCode
302302
Title
A parametric approach to vocal tract length normalization
Author
Eide, Ellen ; Gish, Herbert
Author_Institution
BBN Syst. & Technol. Corp., Cambridge, MA, USA
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
346
Abstract
Differences in vocal tract size among individual speakers contribute to the variability of speech waveforms. The first-order effect of a difference in vocal tract length is a scaling of the frequency axis; a female speaker, for example, exhibits formants roughly 20% higher than the formants of from a male speaker, with the differences most severe in open vocal tract configurations. We describe a parametric method of normalisation which counteracts the effect of varied vocal tract length. The method is shown to be effective across a wide range of recognition systems and paradigms, but is particularly helpful in the case of a small amount of training data
Keywords
acoustic signal processing; parameter estimation; speech processing; speech recognition; Helmholtz resonator; female speaker; first-order effect; formants; frequency axis scaling; male speaker; open vocal tract configurations; parametric method; speech recognition systems; speech waveforms variability; training data; vocal tract length normalization; vocal tract size; Acoustic testing; Frequency; Integrated circuit modeling; Iterative decoding; Iterative methods; Loudspeakers; Maximum likelihood decoding; Maximum likelihood estimation; Speech recognition; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.541103
Filename
541103
Link To Document