DocumentCode :
3348383
Title :
Multiband statistical learning for f0 estimation in speech
Author :
Sha, Fei ; Burgoyne, J. Ashley ; Saul, Lawrence K.
Author_Institution :
Dept. of Comput. & Inf. Sci., Univ. of Pennsylvania, Philadelphia, PA, USA
Volume :
5
fYear :
2004
fDate :
17-21 May 2004
Abstract :
We investigate a simple algorithm that combines multiband processing and least squares fits to estimate f0 contours in speech. The algorithm is untraditional in several respects: it makes no use of FFTs or autocorrelation at the pitch period; it updates the pitch incrementally on a sample-by-sample basis; it avoids peak picking and does not require interpolation in time or frequency to obtain high resolution estimates; it works reliably, in real time, without the need for postprocessing to produce smooth contours. We show that a baseline implementation of the algorithm, though already quite accurate, is significantly improved by incorporating a model of statistical learning into its final stages. Model parameters are estimated from training data to minimize the likelihood of gross errors in f0, as well as errors in classifying voiced versus unvoiced speech. Experimental results on several databases confirm the benefits of statistical learning.
Keywords :
frequency estimation; learning (artificial intelligence); least squares approximations; minimisation; real-time systems; speech processing; statistical analysis; FFT; autocorrelation; f0 estimation; fundamental frequency estimation; interpolation; least squares fits; least squares method; multiband processing; multiband statistical learning; statistical learning; Autocorrelation; Databases; Flexible printed circuits; Frequency estimation; Interpolation; Least squares approximation; Parameter estimation; Speech processing; Statistical learning; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-8484-9
Type :
conf
DOI :
10.1109/ICASSP.2004.1327197
Filename :
1327197
Link To Document :
بازگشت