DocumentCode :
353724
Title :
Parameter optimization for vocal tract length normalization
Author :
Dognin, Pierre ; El-Jaroudi, Amro ; Billa, Jayadev
Author_Institution :
Dept. of Electr. Eng., Pittsburgh Univ., PA, USA
Volume :
3
fYear :
2000
fDate :
2000
Firstpage :
1767
Abstract :
This paper focuses on the optimization of model parameters for vocal tract length normalization (VTLN). For maximum likelihood (ML) based normalization techniques, the complexity of the VTL-models is a source of variation in system performance. An optimal complexity for the VTL-model that ensures best global word error rate is proposed. The choice of frequency warping factor also depends on the signal processing step of VTLN. A best set of parameters for the VTLN signal processing stage is proposed with extensive results for an optimal frequency range
Keywords :
computational complexity; error statistics; physiology; speech; VTL-models; VTLN; complexity; frequency warping factor; global word error rate; maximum likelihood based normalization; parameter optimization; signal processing step; system performance; vocal tract length normalization; Electronic mail; Error analysis; Frequency domain analysis; Frequency estimation; Loudspeakers; Performance loss; Signal processing; Speech processing; Speech recognition; System performance;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
Conference_Location :
Istanbul
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.862095
Filename :
862095
Link To Document :
بازگشت