DocumentCode
774743
Title
Eigenvoice modeling with sparse training data
Author
Kenny, Patrick ; Boulianne, Gilles ; Dumouchel, Pierre
Author_Institution
Centre de Recherche Informatique de Montreal, Canada
Volume
13
Issue
3
fYear
2005
fDate
5/1/2005 12:00:00 AM
Firstpage
345
Lastpage
354
Abstract
We derive an exact solution to the problem of maximum likelihood estimation of the supervector covariance matrix used in extended MAP (or EMAP) speaker adaptation and show how it can be regarded as a new method of eigenvoice estimation. Unlike other approaches to the problem of estimating eigenvoices in situations where speaker-dependent training is not feasible, our method enables us to estimate as many eigenvoices from a given training set as there are training speakers. In the limit as the amount of training data for each speaker tends to infinity, it is equivalent to cluster adaptive training.
Keywords
covariance matrices; eigenvalues and eigenfunctions; maximum likelihood estimation; speaker recognition; cluster adaptive training; eigenvoice estimation; eigenvoice modeling; extended MAP speaker adaptation; maximum a posteriori estimation; maximum likelihood estimation; sparse training data; speech recognition; supervector covariance matrix; training set; training speaker; Covariance matrix; Eigenvalues and eigenfunctions; H infinity control; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Speech recognition; Testing; Training data; Cluster adaptive training; eigenvoices; extended MAP (EMAP); speaker adaptation; speech recognition;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/TSA.2004.840940
Filename
1420369
Link To Document