Title :
Training for polynomial segment model using the expectation maximization algorithm
Author :
Li, Chak-Fai ; Siu, Man-Hung
Author_Institution :
Dept. of Electr. & Electron. Eng., Hong Kong Univ. of Sci. & Technol., China
Abstract :
One of the difficulties in using the polynomial segment model (PSM) to capture the temporal correlations within a phonetic segment is the lack of an efficient training algorithm comparable with the Baum-Welch algorithm in HMM. In our previous paper, we introduced a recursive likelihood computation algorithm for PSM recognition which can perform Viterbi-style training. In this paper, we extend the recursive likelihood computation into a fast forward-backward PSM training algorithm that maximizes PSM likelihood. In addition, we introduce an improved PSM, dynamic multi-region PSM, that allows a data-driven alignment between observations and the segment trajectory. The dynamic multi-region PSM model outperforms HMM and traditional PSM in both phone classification and phone recognition tasks on the TIMIT corpus.
Keywords :
circuit optimisation; maximum likelihood estimation; pattern classification; recursive estimation; speech processing; speech recognition; TIMIT corpus; Viterbi-style training; data-driven alignment; dynamic multi-region PSM; expectation maximization algorithm; forward-backward PSM training algorithm; maximum likelihood estimation; observations; phone classification; phone recognition; phonetic segment; polynomial segment model; recursive likelihood computation; segment trajectory; speech recognition; temporal correlations; Computational complexity; Computer applications; Dynamic programming; Equations; Heuristic algorithms; Hidden Markov models; Maximum likelihood estimation; Parameter estimation; Polynomials; Viterbi algorithm;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
Print_ISBN :
0-7803-8484-9
DOI :
10.1109/ICASSP.2004.1326117