DocumentCode :
3342231
Title :
Multiple-regression hidden Markov model
Author :
Fujinaga, Katsuhisa ; Nakai, Mitsuru ; Shimodaira, Hiroshi ; Sagayama, Shigeki
Author_Institution :
Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
Volume :
1
fYear :
2001
fDate :
2001
Firstpage :
513
Abstract :
Proposes a class of hidden Markov model (HMM) called multiple-regression HMM (MR-HMM) that utilizes auxiliary features such as fundamental frequency (F0) and speaking styles that affect spectral parameters to better model the acoustic features of phonemes. Though such auxiliary features are considered to be the factors that degrade the performance of speech recognizers, the proposed MR-HMM adapts its model parameters, i.e. mean vectors of output probability distributions, depending on these auxiliary information to improve the recognition accuracy. Formulation for parameter reestimation of MR-HMM based on the EM algorithm is given in the paper. Experiments of speaker-dependent isolated word recognition demonstrated that MR-HMMs using F0 based auxiliary features reduced the error rates by more than 20% compared with the conventional HMMs
Keywords :
hidden Markov models; parameter estimation; probability; speech recognition; statistical analysis; acoustic features; mean vectors; multiple-regression hidden Markov model; output probability distributions; parameter reestimation; phonemes; speaker-dependent isolated word recognition; speech recognizers; Cepstral analysis; Context modeling; Degradation; Error analysis; Frequency; Hidden Markov models; Humans; Maximum likelihood linear regression; Probability distribution; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
ISSN :
1520-6149
Print_ISBN :
0-7803-7041-4
Type :
conf
DOI :
10.1109/ICASSP.2001.940880
Filename :
940880
Link To Document :
بازگشت