• DocumentCode
    3528062
  • Title

    Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition

  • Author

    Zhu, Donglai ; Ma, Bin ; Li, Haizhou

  • Author_Institution
    Inst. for Infocomm Res., A*Star, Singapore
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4045
  • Lastpage
    4048
  • Abstract
    This paper extends our previous work on feature transformation-based support vector machines for speaker recognition by proposing a joint MAP adaptation of feature transformation (FT) and Gaussian Mixture Models (GMM) parameters. In the new approach, the prior probability density functions (PDFs) of FT and GMM parameters are jointly estimated using the background data under the maximum likelihood criteria. In this way, we derive a generic prior GMM that is more compact than the Universal Background Model due to the reduction of speaker variations. With the prior PDFs, we construct a supervector to characterize a speaker using FT and GMM parameters. We conducted experiments on NIST 2006 Speaker Recognition Evaluation (SRE06) data set. The results validated the effectiveness of the joint MAP adaptation approach.
  • Keywords
    Gaussian processes; maximum likelihood estimation; speaker recognition; support vector machines; Gaussian mixture model; feature transformation; joint MAP adaptation; maximum likelihood criteria; probability density functions; speaker recognition; support vector machines; Density functional theory; Loudspeakers; Maximum likelihood estimation; Maximum likelihood linear regression; Parameter estimation; Probability density function; Speaker recognition; Speech; Support vector machine classification; Support vector machines; feature transformation; maximum a posteriori; speaker recognition; support vector machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960516
  • Filename
    4960516