• DocumentCode
    323759
  • Title

    Training of subspace distribution clustering hidden Markov model

  • Author

    Mak, Brian ; Bocchieri, Enrico

  • Author_Institution
    AT&T Labs., Florham Park, NJ, USA
  • Volume
    2
  • fYear
    1998
  • fDate
    12-15 May 1998
  • Firstpage
    673
  • Abstract
    Levinson, Juang and Sondhi (1986), and Mak, Bocchieri, and E. Barnard (see Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 1997) presented novel subspace distribution clustering hidden Markov models (SDCHMMs) which can be converted from continuous density hidden Markov models (CDHMMs) by clustering subspace Gaussians in each stream over all models. Though such model conversion is simple and runs fast, it has two drawbacks: (1) it does not take advantage of the fewer model parameters in SDCHMMs-theoretically SDCHMMs may be trained with smaller amount of data; and, (2) it involves two separate optimization steps (first training CDHMMs, then clustering subspace Gaussians) and the resulting SDCHMMs are not guaranteed to be optimal. We show how SDCHMMs may be trained directly from less speech data if we have a priori knowledge of their architecture. On the ATIS task, a speaker-independent, context-independent (CI) 20-stream SDCHMM system trained using our novel SDCHMM reestimation algorithm with only 8 minutes of speech performs as well as a CDHMM system trained using conventional CDHMM reestimation algorithm with 105 minutes of speech
  • Keywords
    Gaussian distribution; hidden Markov models; parameter estimation; pattern recognition; speech recognition; 105 min; 8 min; ATIS task; CDHMM reestimation algorithm; SDCHMM reestimation algorithm; acoustic modelling; context-independent speech recognition; continuous density hidden Markov models; model conversion; optimization steps; speaker-independent system; speech data; subspace Gaussians clustering; subspace distribution clustering HMM; training; Gaussian processes; Hidden Markov models; Signal processing; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
  • Conference_Location
    Seattle, WA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-4428-6
  • Type

    conf

  • DOI
    10.1109/ICASSP.1998.675354
  • Filename
    675354