DocumentCode :
774743
Title :
Eigenvoice modeling with sparse training data
Author :
Kenny, Patrick ; Boulianne, Gilles ; Dumouchel, Pierre
Author_Institution :
Centre de Recherche Informatique de Montreal, Canada
Volume :
13
Issue :
3
fYear :
2005
fDate :
5/1/2005 12:00:00 AM
Firstpage :
345
Lastpage :
354
Abstract :
We derive an exact solution to the problem of maximum likelihood estimation of the supervector covariance matrix used in extended MAP (or EMAP) speaker adaptation and show how it can be regarded as a new method of eigenvoice estimation. Unlike other approaches to the problem of estimating eigenvoices in situations where speaker-dependent training is not feasible, our method enables us to estimate as many eigenvoices from a given training set as there are training speakers. In the limit as the amount of training data for each speaker tends to infinity, it is equivalent to cluster adaptive training.
Keywords :
covariance matrices; eigenvalues and eigenfunctions; maximum likelihood estimation; speaker recognition; cluster adaptive training; eigenvoice estimation; eigenvoice modeling; extended MAP speaker adaptation; maximum a posteriori estimation; maximum likelihood estimation; sparse training data; speech recognition; supervector covariance matrix; training set; training speaker; Covariance matrix; Eigenvalues and eigenfunctions; H infinity control; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Speech recognition; Testing; Training data; Cluster adaptive training; eigenvoices; extended MAP (EMAP); speaker adaptation; speech recognition;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2004.840940
Filename :
1420369
Link To Document :
بازگشت