DocumentCode
3224379
Title
Novel wavelet-based pitch estimation and segmentation of non-stationary speech
Author
Charalampidis, Dimitrios ; Kura, Vijay B.
Author_Institution
Dept. of Electr. Eng., New Orleans Univ., LA, USA
Volume
2
fYear
2005
fDate
25-28 July 2005
Abstract
This paper introduces a novel method for accurate pitch estimation and speech segmentation, named multi-feature, autocorrelation (ACR) and wavelet technique (MAWT). MAWT uses feature extraction, and ACR applied on linear predictive coding (LPC) residuals, with a wavelet-based refinement step. MAWT opens the way for a unique approach to modeling: although speech is divided into segments, the success of voicing decisions is not crucial. Experiments demonstrate the superiority of MAWT in pitch period detection accuracy over existing methods, and illustrate its advantages for speech segmentation. These advantages are more pronounced for gain-varying and transitional speech, and under noisy conditions.
Keywords
correlation theory; feature extraction; linear predictive coding; speech coding; speech recognition; wavelet transforms; ACR; LPC; MAWT; feature extraction; linear predictive coding; multifeature-autocorrelation; nonstationary speech segmentation; pitch period detection accuracy; wavelet-based pitch estimation; Autocorrelation; Cepstrum; Feature extraction; Frequency; Linear predictive coding; Maximum likelihood estimation; Noise robustness; Signal processing; Speech coding; Speech enhancement; Pitch Detection; Speech Coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Fusion, 2005 8th International Conference on
Print_ISBN
0-7803-9286-8
Type
conf
DOI
10.1109/ICIF.2005.1592017
Filename
1592017
Link To Document