Title :
Estimation of the Instantaneous Pitch of Speech
Author :
Resch, Barbara ; Nilsson, Mattias ; Ekman, Anders ; Kleijn, W. Bastiaan
Author_Institution :
Sound & Image Process. Lab., R. Inst. of Technol., Stockholm
fDate :
3/1/2007 12:00:00 AM
Abstract :
An accurate estimation of the pitch is essential for many speech processing applications, such as speech synthesis, speech coding, and speech enhancement. A widely used assumption in most common pitch estimation methods is that pitch is constant over a segment of short duration. This assumption does not apply in reality and leads to inaccurate pitch estimates. In this paper, we present a method for continuous pitch estimation that is able to track fast changes. In the presented framework, the pitch is modeled by a B-spline expansion and optimized in a multistage procedure for increased robustness. The performance of the continuous optimization procedure is compared to state-of-the-art pitch estimation methods and is evaluated both for artificial speech-like signals with known pitch, and for real speech signals. The results of the experiments show that our method leads to a higher accuracy of the estimate of the pitch than state-of-the-art methods
Keywords :
optimisation; speech processing; splines (mathematics); B-spline expansion; continuous optimization procedure; continuous pitch estimation; instantaneous speech pitch estimation; speech coding; speech enhancement; speech processing; speech synthesis; Image processing; Laboratories; Optimization methods; Robustness; Speech analysis; Speech coding; Speech enhancement; Speech processing; Speech synthesis; Spline; Instantaneous pitch; pitch estimation; pitch- synchronous processing; splines;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2006.885242