• DocumentCode
    1135331
  • Title

    Speaker adaptation using an eigenphone basis

  • Author

    Kenny, Patrick ; Boulianne, Gilles ; Ouellet, Pierre ; Dumouchel, Pierre

  • Author_Institution
    Centre de Recherche Informatique de Montreal, Que., Canada
  • Volume
    12
  • Issue
    6
  • fYear
    2004
  • Firstpage
    579
  • Lastpage
    589
  • Abstract
    We describe a new method of estimating speaker-dependent hidden Markov models for speakers in a closed population. Our method differs from previous approaches in that it is based on an explicit model of the correlations between all of the speakers in the population, the idea being that if there is not enough data to estimate a Gaussian mean vector for a given speaker then data from other speakers can be used provided that we know how the speakers are correlated with each other. We explain how to estimate inter-speaker correlations using a Kullback-Leibler divergence minimization technique which can be applied to the problem of estimating the parameters of all of the hyperdistributions that are currently used in Bayesian speaker adaptation.
  • Keywords
    Gaussian processes; acoustic correlation; eigenvalues and eigenfunctions; hidden Markov models; maximum likelihood estimation; speech processing; speech recognition; Gaussian mean vector; Kullback-Leibler divergence minimization technique; MAP estimation; closed population; eigenphone basis; eigenvoices; hidden Markov models; interspeaker correlation estimation; intraspeaker correlations; maximum a posteriori estimation; multispeaker recognition experiments; parameter estimation; speaker adaptation; Bayesian methods; Covariance matrix; Data mining; Gaussian distribution; Hidden Markov models; Maximum likelihood estimation; Parameter estimation; Probability; Statistics; Training data; Eigenphones; eigenvoices; inter-speaker correlations; intra-speaker correlations; speaker adaptation;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2004.825668
  • Filename
    1344025