• DocumentCode
    863246
  • Title

    Modeling inverse covariance matrices by basis expansion

  • Author

    Olsen, Peder A. ; Gopinath, Ramesh A.

  • Author_Institution
    IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA
  • Volume
    12
  • Issue
    1
  • fYear
    2004
  • Firstpage
    37
  • Lastpage
    46
  • Abstract
    This paper proposes a new covariance modeling technique for Gaussian mixture models. Specifically the inverse covariance (precision) matrix of each Gaussian is expanded in a rank-1 basis i.e., Σj-1=Pjk=1DλkjakakT, λkj∈R,ak∈Rd. A generalized EM algorithm is proposed to obtain maximum likelihood parameter estimates for the basis set {akakT}k=1D and the expansion coefficients {λkj}. This model, called the extended maximum likelihood linear transform (EMLLT) model, is extremely flexible: by varying the number of basis elements from D=d to D=d(d+1)/2 one gradually moves from a maximum likelihood linear transform (MLLT) model to a full-covariance model. Experimental results on two speech recognition tasks show that the EMLLT model can give relative gains of up to 35% in the word error rate over a standard diagonal covariance model, 30% over a standard MLLT model.
  • Keywords
    covariance matrices; matrix inversion; maximum likelihood estimation; speech recognition; EMLLT model; Gaussian mixture models; basis expansion; covariance modeling technique; density functions; extended maximum likelihood linear transform model; generalized EM algorithm; inverse covariance matrices; maximum likelihood parameter estimates; speech recognition; Context modeling; Covariance matrix; Density functional theory; Error analysis; Hidden Markov models; Inverse problems; Maximum likelihood estimation; Parameter estimation; Speech recognition; Symmetric matrices;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2003.819943
  • Filename
    1261270