Title :
Spectrogram-based formant tracking via particle filters
Author :
Shi, Yu ; Chang, Eric
Abstract :
The paper presents a particle-filtering method for estimating formant frequencies of speech signals from spectrograms. First, frequency bands corresponding to the analyzed formants are extracted via a two-step dynamic programming based algorithm. A particle-filtering method is then used to locate accurately formants in every formant area based on the posterior PDF described by a set of support points with associated weights. Formant trajectories of voiced frames of a group of 81 utterances were manually tracked and labeled, partly for model training and partly for algorithm evaluation. In the experiments, the proposed method obtains average estimation errors of 72, 115, and 113 Hz for the first three formants, respectively, whereas the LPC based method induces 118, 172, and 250 Hz deviations. The experimental results show that the formants estimated by the proposed method are quite reliable and the trajectories are more accurate than LPC.
Keywords :
Monte Carlo methods; dynamic programming; frequency estimation; linear predictive coding; nonlinear filters; spectral analysis; speech processing; statistical analysis; tracking; LPC; dynamic programming; estimation errors; formant frequency estimation; formant tracking; formant trajectories; model training; nonlinear filters; particle filters; posterior PDF; sequential Monte Carlo methods; speech signal spectrograms; voiced frames; Algorithm design and analysis; Dynamic programming; Frequency estimation; Heuristic algorithms; Linear predictive coding; Particle filters; Particle tracking; Spectrogram; Speech; Trajectory;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198743