DocumentCode
1135331
Title
Speaker adaptation using an eigenphone basis
Author
Kenny, Patrick ; Boulianne, Gilles ; Ouellet, Pierre ; Dumouchel, Pierre
Author_Institution
Centre de Recherche Informatique de Montreal, Que., Canada
Volume
12
Issue
6
fYear
2004
Firstpage
579
Lastpage
589
Abstract
We describe a new method of estimating speaker-dependent hidden Markov models for speakers in a closed population. Our method differs from previous approaches in that it is based on an explicit model of the correlations between all of the speakers in the population, the idea being that if there is not enough data to estimate a Gaussian mean vector for a given speaker then data from other speakers can be used provided that we know how the speakers are correlated with each other. We explain how to estimate inter-speaker correlations using a Kullback-Leibler divergence minimization technique which can be applied to the problem of estimating the parameters of all of the hyperdistributions that are currently used in Bayesian speaker adaptation.
Keywords
Gaussian processes; acoustic correlation; eigenvalues and eigenfunctions; hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; Gaussian mean vector; Kullback-Leibler divergence minimization technique; MAP estimation; closed population; eigenphone basis; eigenvoices; hidden Markov models; interspeaker correlation estimation; intraspeaker correlations; maximum a posteriori estimation; multispeaker recognition experiments; parameter estimation; speaker adaptation; Bayesian methods; Covariance matrix; Data mining; Gaussian distribution; Hidden Markov models; Maximum likelihood estimation; Parameter estimation; Probability; Statistics; Training data; Eigenphones; eigenvoices; inter-speaker correlations; intra-speaker correlations; speaker adaptation;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/TSA.2004.825668
Filename
1344025
Link To Document