Title :
Eigenvoice modeling with sparse training data
Author :
Kenny, Patrick ; Boulianne, Gilles ; Dumouchel, Pierre
Author_Institution :
Centre de Recherche Informatique de Montreal, Canada
fDate :
5/1/2005 12:00:00 AM
Abstract :
We derive an exact solution to the problem of maximum likelihood estimation of the supervector covariance matrix used in extended MAP (or EMAP) speaker adaptation and show how it can be regarded as a new method of eigenvoice estimation. Unlike other approaches to the problem of estimating eigenvoices in situations where speaker-dependent training is not feasible, our method enables us to estimate as many eigenvoices from a given training set as there are training speakers. In the limit as the amount of training data for each speaker tends to infinity, it is equivalent to cluster adaptive training.
Keywords :
covariance matrices; eigenvalues and eigenfunctions; maximum likelihood estimation; speaker recognition; cluster adaptive training; eigenvoice estimation; eigenvoice modeling; extended MAP speaker adaptation; maximum a posteriori estimation; maximum likelihood estimation; sparse training data; speech recognition; supervector covariance matrix; training set; training speaker; Covariance matrix; Eigenvalues and eigenfunctions; H infinity control; Hidden Markov models; Loudspeakers; Maximum likelihood estimation; Principal component analysis; Speech recognition; Testing; Training data; Cluster adaptive training; eigenvoices; extended MAP (EMAP); speaker adaptation; speech recognition;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2004.840940