DocumentCode
1445825
Title
Subspace distribution clustering hidden Markov model
Author
Bocchieri, Enrico ; Mak, Brian Kan-Wing
Author_Institution
AT&T Labs.-Res., Florham Park, NJ, USA
Volume
9
Issue
3
fYear
2001
fDate
3/1/2001 12:00:00 AM
Firstpage
264
Lastpage
275
Abstract
Most contemporary laboratory recognizers require too much memory to run, and are too slow for mass applications. One major cause of the problem is the large parameter space of their acoustic models. In this paper, we propose a new acoustic modeling methodology which we call subspace distribution clustering hidden Markov modeling (SDCHMM) with the aim of achieving much more compact acoustic models. The theory of SDCHMM is based on tying the parameters of a new unit, namely the subspace distribution, of continuous density hidden Markov models (CDHMMs). SDCHMMs can be converted from CDHMMs by projecting the distributions of the CDHMMs onto orthogonal subspaces, and then tying similar subspace distributions over all states and all acoustic models in each subspace, by exploiting the combinatorial effect of subspace distribution encoding, all original full-space distributions can be represented by combinations of a small number of subspace distribution prototypes. Consequently, there is a great reduction in the number of model parameters, and thus substantial savings in memory and computation. This renders SDCHMM very attractive in the practical implementation of acoustic models. Evaluation on the Airline Travel Information System (ATIS) task shows that in comparison to its parent CDHMM system, a converted SDCHMM system achieves seven- to 18-fold reduction in memory requirement for acoustic models, and runs 30%-60% faster without any loss of recognition accuracy
Keywords
hidden Markov models; pattern clustering; speech recognition; ATIS task; Airline Travel Information System; CDHMM; SDCHMM; acoustic models; combinatorial effect; continuous density hidden Markov models; full-space distributions; memory requirement; orthogonal subspaces; subspace distribution clustering hidden Markov model; subspace distribution encoding; Associate members; Distributed computing; Encoding; Hidden Markov models; Information systems; Laboratories; Power system modeling; Prototypes; Speech; Training data;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.906000
Filename
906000
Link To Document