• DocumentCode
    1445825
  • Title

    Subspace distribution clustering hidden Markov model

  • Author

    Bocchieri, Enrico ; Mak, Brian Kan-Wing

  • Author_Institution
    AT&T Labs.-Res., Florham Park, NJ, USA
  • Volume
    9
  • Issue
    3
  • fYear
    2001
  • fDate
    3/1/2001 12:00:00 AM
  • Firstpage
    264
  • Lastpage
    275
  • Abstract
    Most contemporary laboratory recognizers require too much memory to run, and are too slow for mass applications. One major cause of the problem is the large parameter space of their acoustic models. In this paper, we propose a new acoustic modeling methodology which we call subspace distribution clustering hidden Markov modeling (SDCHMM) with the aim of achieving much more compact acoustic models. The theory of SDCHMM is based on tying the parameters of a new unit, namely the subspace distribution, of continuous density hidden Markov models (CDHMMs). SDCHMMs can be converted from CDHMMs by projecting the distributions of the CDHMMs onto orthogonal subspaces, and then tying similar subspace distributions over all states and all acoustic models in each subspace, by exploiting the combinatorial effect of subspace distribution encoding, all original full-space distributions can be represented by combinations of a small number of subspace distribution prototypes. Consequently, there is a great reduction in the number of model parameters, and thus substantial savings in memory and computation. This renders SDCHMM very attractive in the practical implementation of acoustic models. Evaluation on the Airline Travel Information System (ATIS) task shows that in comparison to its parent CDHMM system, a converted SDCHMM system achieves seven- to 18-fold reduction in memory requirement for acoustic models, and runs 30%-60% faster without any loss of recognition accuracy
  • Keywords
    hidden Markov models; pattern clustering; speech recognition; ATIS task; Airline Travel Information System; CDHMM; SDCHMM; acoustic models; combinatorial effect; continuous density hidden Markov models; full-space distributions; memory requirement; orthogonal subspaces; subspace distribution clustering hidden Markov model; subspace distribution encoding; Associate members; Distributed computing; Encoding; Hidden Markov models; Information systems; Laboratories; Power system modeling; Prototypes; Speech; Training data;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.906000
  • Filename
    906000