DocumentCode
454520
Title
Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer
Author
Oura, Keiichiro ; Zen, Heiga ; Nankaku, Yoshihiko ; Lee, Akinobu ; Tokuda, Keiichi
Author_Institution
Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol.
Volume
1
fYear
2006
fDate
14-19 May 2006
Abstract
In hidden Markov models (HMMs), state duration probabilities decrease exponentially with time. It would be an inappropriate representation of temporal structure of speech. One of the solutions for this problem is integrating state duration probability distributions explicitly into the HMM. This form is known as a hidden semi-Markov model (HSMM). Although a number of attempts to use explicit duration models in speech recognition systems have been proposed, they are not consistent because various approximations were used in both training and decoding. In the present paper, a fully consistent speech recognition system based on the HSMM framework is proposed. In a speaker-dependent continuous speech recognition experiment, HSMM-based speech recognition system achieved about 5.9% relative error reduction over the corresponding HMM-based one
Keywords
hidden Markov models; speech recognition; statistical distributions; HMM; hidden semi-Markov model; speech recognition system; state duration probability distributions; weighted finite-state transducer; Clustering algorithms; Computer science; Context modeling; Decoding; Hidden Markov models; Probability distribution; Speech recognition; State estimation; Statistical distributions; Transducers;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1659950
Filename
1659950
Link To Document