• DocumentCode
    774743
  • Title

    Eigenvoice modeling with sparse training data

  • Author

    Kenny, Patrick ; Boulianne, Gilles ; Dumouchel, Pierre

  • Author_Institution
    Centre de Recherche Informatique de Montreal, Canada
  • Volume
    13
  • Issue
    3
  • fYear
    2005
  • fDate
    5/1/2005 12:00:00 AM
  • Firstpage
    345
  • Lastpage
    354
  • Abstract
    We derive an exact solution to the problem of maximum likelihood estimation of the supervector covariance matrix used in extended MAP (or EMAP) speaker adaptation and show how it can be regarded as a new method of eigenvoice estimation. Unlike other approaches to the problem of estimating eigenvoices in situations where speaker-dependent training is not feasible, our method enables us to estimate as many eigenvoices from a given training set as there are training speakers. In the limit as the amount of training data for each speaker tends to infinity, it is equivalent to cluster adaptive training.
  • Keywords
    covariance matrices; eigenvalues and eigenfunctions; maximum likelihood estimation; speaker recognition; cluster adaptive training; eigenvoice estimation; eigenvoice modeling; extended MAP speaker adaptation; maximum a posteriori estimation; maximum likelihood estimation; sparse training data; speech recognition; supervector covariance matrix; training set; training speaker; Covariance matrix; Eigenvalues and eigenfunctions; H infinity control; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Speech recognition; Testing; Training data; Cluster adaptive training; eigenvoices; extended MAP (EMAP); speaker adaptation; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2004.840940
  • Filename
    1420369