Title :
The third-order cumulant of speech signals with application to reliable pitch estimation
Author :
Nemer, Elias ; Goubran, Rafik ; Mahmoud, Samy
Author_Institution :
Nortel, Verdun, Que., Canada
Abstract :
This paper provides a formal framework for using the third-order statistics (TOS) of speech signals and presents a new method for estimating the pitch and making voicing decision using the 3rd-order cumulant of the LPC residual. Analytical expressions for the horizontal slice of the 3rd-order cumulant as well as the kurtosis of voiced speech are derived using the McAulay sinusoidal model (McAulay et al., 1986). The derivations demonstrate that the skewness of voiced speech is sufficiently distinct from that of Gaussian noise and can be used to aid in detecting voicing. It is also shown that the 3rd-order cumulant slice has distinct characteristics in terms of periodicity, phase and harmonic content and is a reliable candidate for estimating the pitch. Actual speech data is used to verify the derivations and experimental results using Gaussian and street noise are used to demonstrate the performance in noisy conditions
Keywords :
Gaussian noise; higher order statistics; linear predictive coding; parameter estimation; signal representation; speech processing; Gaussian noise; LPC residual; McAulay sinusoidal model; TOS; horizontal slice; kurtosis; performance; pitch estimation; skewness; speech signals; street noise; third-order cumulant; third-order statistics; voicing decision; Autocorrelation; Business; Gaussian noise; Gaussian processes; Higher order statistics; Linear predictive coding; Phase estimation; Signal processing; Speech analysis; Speech enhancement;
Conference_Titel :
Statistical Signal and Array Processing, 1998. Proceedings., Ninth IEEE SP Workshop on
Conference_Location :
Portland, OR
Print_ISBN :
0-7803-5010-3
DOI :
10.1109/SSAP.1998.739426