Title :
Multiple mixture segmental HMM and its applications
Author :
Xiang, Bing ; Berger, Toby
Author_Institution :
Sch. of Electr. Eng. & Comput. Eng., Cornell Univ., Ithaca, NY, USA
Abstract :
A multiple mixture segmental hidden Markov model (MMSHMM) is presented. This model is extended from the linear probabilistic-trajectory segmental HMM. Each segment is characterized by a linear trajectory with slope and mid-point parameters, and also the residual error covariances around the trajectory, so that both extra-segmental and intra-segmental variation are represented. Instead of modeling single distribution for each model parameter as earlier work, we use multiple mixture components for model parameters to represent the variability due to the variation within each speaker and also the differences between speakers. This model is evaluated on two applications. One is a phonetic classification task with TIMIT corpus, which shows that MMSHMM has advantages over conventional HMM. Another one is a speaker-independent keyword spotting task with the Road Rally database. By rescoring putative events hypothesized by a primary HMM keyword spotter, the experiments show that the performance is improved through distinguishing true hits from false alarms
Keywords :
hidden Markov models; parameter estimation; probability; speech recognition; Road Rally database; TIMIT corpus; extra-segmental variation; intra-segmental variation; linear probabilistic-trajectory segmental HMM; linear trajectory; multiple mixture segmental HMM; multiple mixture segmental hidden Markov model; phonetic classification task; residual error covariances; speaker-independent keyword spotting task; Application software; Databases; Error analysis; Gaussian distribution; Hidden Markov models; Humans; Probability distribution; Production systems; Speech recognition; Statistical distributions;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940879