DocumentCode :
3546254
Title :
A robust pitch estimation approach for colored noise-corrupted speech
Author :
Shahnaz, C. ; Zhu, W.P. ; Ahmad, M.O.
Author_Institution :
Dept. of Electr. & Comput. Eng., Concordia Univ., Montreal, Que., Canada
fYear :
2005
fDate :
23-26 May 2005
Firstpage :
3143
Abstract :
We present an integrated pitch estimation approach for severely colored noise-corrupted speech. An effective colored noise-whitening process is first applied to the noisy speech. Then, a variable-length average magnitude difference function (VLAMDF) of the pre-filtered noisy speech (PFNS) is proposed, which almost conquers the trend of falling valleys in the conventional AMDF. The amplitude characteristic of the VLAMDF is reshaped by means of a simple linear transformation to reduce the possibility of double-pitch-errors. As the VLAMDF exhibits a valley while the autocorrelation function (ACF) of PFNS provides a peak, the ACF is weighted by the reciprocal of the VLAMDF to emphasize the pitch-candidate as well as to suppress the non-pitch peaks. Moreover, a noise-robust pitch detection in the time-domain is guaranteed by collaboration of this enhanced autocorrelation function with the reshaped version of the VLAMDF. The proposed approach is simulated using the Keele reference database and provides a superior accuracy relative to some of the existing methods implemented in the presence of colored noise, even at a very low signal-to noise ratio (SNR) of -15 dB.
Keywords :
acoustic noise; parameter estimation; random noise; signal denoising; speech processing; time-domain analysis; Keele reference database; SNR; amplitude characteristic; autocorrelation function; colored noise-corrupted speech; colored noise-whitening process; double-pitch-errors; falling valleys; linear transformation; pitch estimation; pre-filtered noisy speech; signal-to noise ratio; variable-length average magnitude difference function; Additive noise; Autocorrelation; Colored noise; Noise robustness; Signal processing; Signal to noise ratio; Speech coding; Speech processing; Speech synthesis; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuits and Systems, 2005. ISCAS 2005. IEEE International Symposium on
Print_ISBN :
0-7803-8834-8
Type :
conf
DOI :
10.1109/ISCAS.2005.1465294
Filename :
1465294
Link To Document :
بازگشت