• DocumentCode
    454520
  • Title

    Hidden Semi-Markov Model Based Speech Recognition System using Weighted Finite-State Transducer

  • Author

    Oura, Keiichiro ; Zen, Heiga ; Nankaku, Yoshihiko ; Lee, Akinobu ; Tokuda, Keiichi

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Nagoya Inst. of Technol.
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    In hidden Markov models (HMMs), state duration probabilities decrease exponentially with time. It would be an inappropriate representation of temporal structure of speech. One of the solutions for this problem is integrating state duration probability distributions explicitly into the HMM. This form is known as a hidden semi-Markov model (HSMM). Although a number of attempts to use explicit duration models in speech recognition systems have been proposed, they are not consistent because various approximations were used in both training and decoding. In the present paper, a fully consistent speech recognition system based on the HSMM framework is proposed. In a speaker-dependent continuous speech recognition experiment, HSMM-based speech recognition system achieved about 5.9% relative error reduction over the corresponding HMM-based one
  • Keywords
    hidden Markov models; speech recognition; statistical distributions; HMM; hidden semi-Markov model; speech recognition system; state duration probability distributions; weighted finite-state transducer; Clustering algorithms; Computer science; Context modeling; Decoding; Hidden Markov models; Probability distribution; Speech recognition; State estimation; Statistical distributions; Transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1659950
  • Filename
    1659950