DocumentCode :
3224379
Title :
Novel wavelet-based pitch estimation and segmentation of non-stationary speech
Author :
Charalampidis, Dimitrios ; Kura, Vijay B.
Author_Institution :
Dept. of Electr. Eng., New Orleans Univ., LA, USA
Volume :
2
fYear :
2005
fDate :
25-28 July 2005
Abstract :
This paper introduces a novel method for accurate pitch estimation and speech segmentation, named multi-feature, autocorrelation (ACR) and wavelet technique (MAWT). MAWT uses feature extraction, and ACR applied on linear predictive coding (LPC) residuals, with a wavelet-based refinement step. MAWT opens the way for a unique approach to modeling: although speech is divided into segments, the success of voicing decisions is not crucial. Experiments demonstrate the superiority of MAWT in pitch period detection accuracy over existing methods, and illustrate its advantages for speech segmentation. These advantages are more pronounced for gain-varying and transitional speech, and under noisy conditions.
Keywords :
correlation theory; feature extraction; linear predictive coding; speech coding; speech recognition; wavelet transforms; ACR; LPC; MAWT; feature extraction; linear predictive coding; multifeature-autocorrelation; nonstationary speech segmentation; pitch period detection accuracy; wavelet-based pitch estimation; Autocorrelation; Cepstrum; Feature extraction; Frequency; Linear predictive coding; Maximum likelihood estimation; Noise robustness; Signal processing; Speech coding; Speech enhancement; Pitch Detection; Speech Coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Fusion, 2005 8th International Conference on
Print_ISBN :
0-7803-9286-8
Type :
conf
DOI :
10.1109/ICIF.2005.1592017
Filename :
1592017
Link To Document :
بازگشت