DocumentCode
542331
Title
Modeling inverse covariance matrices by basis expansion
Author
Olsen, Peder A. ; Gopinath, Ramesh A.
Author_Institution
IBM, T. J. Watson Research Center, 134 and Taconic Parkway, Yorktown Heights, NY 10598, USA
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
This paper proposes a new covariance modeling technique for Gaussian Mixture Models. Specifically the inverse covariance (precision) matrix of each Gaussian is expanded in a rank-1 basis i.e., Σj −1 = Pj = Σk = 1 D λk jak ak T, λk j ∈ ℝd. A generalized EM algorithm is proposed to obtain maximum likelihood parameter estimates for the basis set {ak ak T} and the expansion coefficients {λk j}. This model, called the Extended Maximum Likelihood Linear Transform (EMLLT) model, is extremely flexible: by varying the number of basis elements from d to d(d + 1)/2 one gradually moves from a Maximum Likelihood Linear Transform (MLLT) model to a full-covariance model. Experimental results on two speech recognition tasks show that the EMLLT model can give relative gains of up to 35% in the word error rate over a standard diagonal covariance model.
Keywords
Acoustics; Computational modeling; Covariance matrix; Databases; Estimation; Hidden Markov models; Transforms;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743949
Filename
5743949
Link To Document