• DocumentCode
    417266
  • Title

    Training for polynomial segment model using the expectation maximization algorithm

  • Author

    Li, Chak-Fai ; Siu, Man-Hung

  • Author_Institution
    Dept. of Electr. & Electron. Eng., Hong Kong Univ. of Sci. & Technol., China
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    One of the difficulties in using the polynomial segment model (PSM) to capture the temporal correlations within a phonetic segment is the lack of an efficient training algorithm comparable with the Baum-Welch algorithm in HMM. In our previous paper, we introduced a recursive likelihood computation algorithm for PSM recognition which can perform Viterbi-style training. In this paper, we extend the recursive likelihood computation into a fast forward-backward PSM training algorithm that maximizes PSM likelihood. In addition, we introduce an improved PSM, dynamic multi-region PSM, that allows a data-driven alignment between observations and the segment trajectory. The dynamic multi-region PSM model outperforms HMM and traditional PSM in both phone classification and phone recognition tasks on the TIMIT corpus.
  • Keywords
    circuit optimisation; maximum likelihood estimation; pattern classification; recursive estimation; speech processing; speech recognition; TIMIT corpus; Viterbi-style training; data-driven alignment; dynamic multi-region PSM; expectation maximization algorithm; forward-backward PSM training algorithm; maximum likelihood estimation; observations; phone classification; phone recognition; phonetic segment; polynomial segment model; recursive likelihood computation; segment trajectory; speech recognition; temporal correlations; Computational complexity; Computer applications; Dynamic programming; Equations; Heuristic algorithms; Hidden Markov models; Maximum likelihood estimation; Parameter estimation; Polynomials; Viterbi algorithm;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326117
  • Filename
    1326117