Title :
MMI training for continuous phoneme recognition on the TIMIT database
Author :
Kapadia, S. ; Valtchev, V. ; Young, S.J.
Author_Institution :
Eng. Dept., Cambridge Univ., UK
Abstract :
Experiences with a phoneme recognition system for the TIMIT database which uses multiple mixture continuous-density monophone HMMs (hidden Markov models) trained using MMI (maximum mutual information) is reported. A comprehensive set of results are presented comparing the ML (maximum likelihood) and MMI training criteria for both diagonal and full covariance models. These results using simple monophone HMMs show that clear performance gains are achieved by MMI training. These results are comparable with the best reported by others, including those which use context-dependent models. In addition, a number of performance and implementation issues which are crucial to successful MMI training are discussed.<>
Keywords :
hidden Markov models; learning (artificial intelligence); speech recognition; MMI training; TIMIT database; continuous phoneme recognition; diagonal covariance models; full covariance models; hidden Markov models; implementation; maximum mutual information; performance gains;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1993. ICASSP-93., 1993 IEEE International Conference on
Conference_Location :
Minneapolis, MN, USA
Print_ISBN :
0-7803-7402-9
DOI :
10.1109/ICASSP.1993.319349