DocumentCode :
863246
Title :
Modeling inverse covariance matrices by basis expansion
Author :
Olsen, Peder A. ; Gopinath, Ramesh A.
Author_Institution :
IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
Volume :
12
Issue :
1
fYear :
2004
Firstpage :
37
Lastpage :
46
Abstract :
This paper proposes a new covariance modeling technique for Gaussian mixture models. Specifically the inverse covariance (precision) matrix of each Gaussian is expanded in a rank-1 basis i.e., Σj-1=Pjk=1DλkjakakT, λkj∈R,ak∈Rd. A generalized EM algorithm is proposed to obtain maximum likelihood parameter estimates for the basis set {akakT}k=1D and the expansion coefficients {λkj}. This model, called the extended maximum likelihood linear transform (EMLLT) model, is extremely flexible: by varying the number of basis elements from D=d to D=d(d+1)/2 one gradually moves from a maximum likelihood linear transform (MLLT) model to a full-covariance model. Experimental results on two speech recognition tasks show that the EMLLT model can give relative gains of up to 35% in the word error rate over a standard diagonal covariance model, 30% over a standard MLLT model.
Keywords :
covariance matrices; matrix inversion; maximum likelihood estimation; speech recognition; EMLLT model; Gaussian mixture models; basis expansion; covariance modeling technique; density functions; extended maximum likelihood linear transform model; generalized EM algorithm; inverse covariance matrices; maximum likelihood parameter estimates; speech recognition; Context modeling; Covariance matrix; Density functional theory; Error analysis; Hidden Markov models; Inverse problems; Maximum likelihood estimation; Parameter estimation; Speech recognition; Symmetric matrices;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2003.819943
Filename :
1261270
Link To Document :
بازگشت