Title :
Pitch Estimation Based on a Harmonic Sinusoidal Autocorrelation Model and a Time-Domain Matching Scheme
Author :
Shahnaz, Celia ; Zhu, Wei-Ping ; Ahmad, M. Omair
Author_Institution :
Dept. of Electr. & Electron. Eng., Bangladesh Univ. of Eng. & Technol., Dhaka, Bangladesh
Abstract :
In this paper, a method for the estimation of pitch from noise-corrupted speech observations based on extracting a pitch harmonic and the corresponding harmonic number is proposed. Starting from the harmonic representation of clean speech, a simple yet accurate harmonic sinusoidal autocorrelation (HSAC) model is first derived. By employing this HSAC model expressed in terms of the pitch harmonics of the clean speech, a new autocorrelation-domain least-squares fitting optimization technique is developed to extract a pitch harmonic from the noisy speech. Then, the harmonic number associated with the pitch harmonic is determined by maximizing an objective function formulated as an impulse-train weighted symmetric average magnitude sum function (SAMSF) of the noisy speech. The period of the impulse-train is governed by the estimated pitch harmonic and the maximization of the objective function is carried out through a time-domain matching of periodicity of the impulse-train with that of the SAMSF. An SAMSF-based pitch tracking scheme using dynamic programming is devised to obtain a smoothed pitch contour. In order to demonstrate the efficacy of the proposed method, simulations are conducted by considering naturally spoken speech signals in the presence of white or multi-talker babble noise at different signal-to-noise ratio (SNR) levels. A comprehensive evaluation of the pitch estimation results shows the superiority of the proposed method over some of the state-of-the-art methods under low levels of SNR.
Keywords :
correlation methods; dynamic programming; estimation theory; harmonic analysis; impulse noise; least squares approximations; signal processing; speech processing; time-domain analysis; white noise; HSAC model; SAMSF-based pitch tracking scheme; SNR levels; autocorrelation-domain least-squares fitting optimization technique; clean speech; comprehensive evaluation; dynamic programming; estimated pitch harmonic; harmonic number; harmonic representation; harmonic sinusoidal autocorrelation model; impulse-train weighted symmetric average magnitude sum function; maximization; multitalker babble noise; naturally spoken speech signals; noise-corrupted speech observations; noisy speech; objective function; periodicity; pitch estimation; pitch harmonic extraction; pitch harmonics; signal-to-noise ratio levels; smoothed pitch contour; state-of-the-art methods; time-domain matching scheme; white noise; Discrete cosine transforms; Estimation; Harmonic analysis; Noise; Noise measurement; Power harmonic filters; Speech; Autocorrelation; fundamental frequency; harmonic sinusoidal autocorrelation (HSAC) model; harmonics; impulse-train; low signal-to-noise ratio (SNR); pitch; speech analysis; symmetric average magnitude sum function (SAMSF);
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2011.2161579