• DocumentCode
    3492236
  • Title

    Segmental intensity and HMM modeling

  • Author

    Dumouchel, P. ; Shaughnessy, D. O´

  • Author_Institution
    INRS-Telecommun., Quebec Univ., Verdun, Que., Canada
  • Volume
    2
  • fYear
    1995
  • fDate
    5-8 Sep 1995
  • Firstpage
    995
  • Abstract
    We propose to use a stochastic segmental intensity model independent of the HMM model in INRS´s large vocabulary continuous speech recognizer. First, we examine how to insert this model into the search algorithm without violating the optimality constraints of this algorithm. Second, we propose and test the performance of four different intensity models. The training and testing of the models is done on a studio quality speaker-dependent speech corpus. The first model is a Gaussian mixture phone intensity model independent of the phonemic context. The second model is a Gaussian mixture phone intensity model dependent on the right or left phoneme context. The third model is a Gaussian mixture intensity model based on the variation of intensity within a diphone. Finally, the last model consists of a stochastic silence-speech detector. Performance comparisons show that the best model uses Gaussian mixture of the variation of intensity within a diphone (third model). This model improves the percentage of word recognition from 89.58% (no intensity modeling) to 90.92%
  • Keywords
    Gaussian processes; hidden Markov models; speech recognition; Gaussian mixture phone intensity model; HMM model; continuous speech recognition; diphone; large vocabulary; optimality constraints; phoneme context; search algorithm; stochastic segmental intensity model; stochastic silence-speech detector; studio quality speaker-dependent speech corpus; testing; training; word recognition; Automata; Business; Context modeling; Detectors; Hidden Markov models; Speech recognition; Stochastic processes; Stress; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical and Computer Engineering, 1995. Canadian Conference on
  • Conference_Location
    Montreal, Que.
  • ISSN
    0840-7789
  • Print_ISBN
    0-7803-2766-7
  • Type

    conf

  • DOI
    10.1109/CCECE.1995.526596
  • Filename
    526596