Title :
A Robust pitch estimation approach for clean speech
Author :
Ben Messaoud, Mohamed Anouar ; Bouzid, A. ; Ellouze, Noureddine
Author_Institution :
Signal, Image & Pattern Recognition Lab., FST Le Belvedere, Tunis, Tunisia
Abstract :
In this work, we present an algorithm for estimating the fundamental frequency in speech signals. Our approach is based on the spectral compression by the autocorrelation of the speech multi-scale product analysis. It consists of operating the product of compressed copies of the original spectrum on the multi-scale product autocorrelation. The multi-scale product is based on making the product of the speech wavelet transform coefficients at three successive dyadic scales. The wavelet used is the quadratic spline function with a support of 0.8 ms. We estimate the pitch for each time frame based on its multi-scale product autocorrelation of the harmonic product spectrum structure. We evaluate our approach on the Keele database. Experimental results show the effectiveness of our method presenting a good performance surpassing the other algorithms.
Keywords :
speech processing; splines (mathematics); wavelet transforms; Keele database; autocorrelation method; clean speech; fundamental frequency estimation; quadratic spline function; robust pitch estimation; spectral compression; speech multiscale product analysis; speech wavelet transform coefficient; successive dyadic scale; Algorithm design and analysis; Correlation; Estimation; Harmonic analysis; Speech; Wavelet transforms; autocorrelation analysis; multi-scale product; pitch estimation; spectrum compression; speech;
Conference_Titel :
Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
Conference_Location :
Sousse
Print_ISBN :
978-1-4673-1657-6
DOI :
10.1109/SETIT.2012.6482009