DocumentCode
3442401
Title
Spectrogram-based formant tracking via particle filters
Author
Shi, Yu ; Chang, Eric
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
The paper presents a particle-filtering method for estimating formant frequencies of speech signals from spectrograms. First, frequency bands corresponding to the analyzed formants are extracted via a two-step dynamic programming based algorithm. A particle-filtering method is then used to locate accurately formants in every formant area based on the posterior PDF described by a set of support points with associated weights. Formant trajectories of voiced frames of a group of 81 utterances were manually tracked and labeled, partly for model training and partly for algorithm evaluation. In the experiments, the proposed method obtains average estimation errors of 72, 115, and 113 Hz for the first three formants, respectively, whereas the LPC based method induces 118, 172, and 250 Hz deviations. The experimental results show that the formants estimated by the proposed method are quite reliable and the trajectories are more accurate than LPC.
Keywords
Monte Carlo methods; dynamic programming; frequency estimation; linear predictive coding; nonlinear filters; spectral analysis; speech processing; statistical analysis; tracking; LPC; dynamic programming; estimation errors; formant frequency estimation; formant tracking; formant trajectories; model training; nonlinear filters; particle filters; posterior PDF; sequential Monte Carlo methods; speech signal spectrograms; voiced frames; Algorithm design and analysis; Dynamic programming; Frequency estimation; Heuristic algorithms; Linear predictive coding; Particle filters; Particle tracking; Spectrogram; Speech; Trajectory;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198743
Filename
1198743
Link To Document