• DocumentCode
    3442401
  • Title

    Spectrogram-based formant tracking via particle filters

  • Author

    Shi, Yu ; Chang, Eric

  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    The paper presents a particle-filtering method for estimating formant frequencies of speech signals from spectrograms. First, frequency bands corresponding to the analyzed formants are extracted via a two-step dynamic programming based algorithm. A particle-filtering method is then used to locate accurately formants in every formant area based on the posterior PDF described by a set of support points with associated weights. Formant trajectories of voiced frames of a group of 81 utterances were manually tracked and labeled, partly for model training and partly for algorithm evaluation. In the experiments, the proposed method obtains average estimation errors of 72, 115, and 113 Hz for the first three formants, respectively, whereas the LPC based method induces 118, 172, and 250 Hz deviations. The experimental results show that the formants estimated by the proposed method are quite reliable and the trajectories are more accurate than LPC.
  • Keywords
    Monte Carlo methods; dynamic programming; frequency estimation; linear predictive coding; nonlinear filters; spectral analysis; speech processing; statistical analysis; tracking; LPC; dynamic programming; estimation errors; formant frequency estimation; formant tracking; formant trajectories; model training; nonlinear filters; particle filters; posterior PDF; sequential Monte Carlo methods; speech signal spectrograms; voiced frames; Algorithm design and analysis; Dynamic programming; Frequency estimation; Heuristic algorithms; Linear predictive coding; Particle filters; Particle tracking; Spectrogram; Speech; Trajectory;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198743
  • Filename
    1198743