• DocumentCode
    1234315
  • Title

    A Study of Interspeaker Variability in Speaker Verification

  • Author

    Kenny, Patrick ; Ouellet, Pierre ; Dehak, Najim ; Gupta, Vishwa ; Dumouchel, Pierre

  • Author_Institution
    Centre de Rech. Inf. de Montreal, Montreal, QC
  • Volume
    16
  • Issue
    5
  • fYear
    2008
  • fDate
    7/1/2008 12:00:00 AM
  • Firstpage
    980
  • Lastpage
    988
  • Abstract
    We propose a new approach to the problem of estimating the hyperparameters which define the interspeaker variability model in joint factor analysis. We tested the proposed estimation technique on the NIST 2006 speaker recognition evaluation data and obtained 10%-15% reductions in error rates on the core condition and the extended data condition (as measured both by equal error rates and the NIST detection cost function). We show that when a large joint factor analysis model is trained in this way and tested on the core condition, the extended data condition and the cross-channel condition, it is capable of performing at least as well as fusions of multiple systems of other types. (The comparisons are based on the best results on these tasks that have been reported in the literature.) In the case of the cross-channel condition, a factor analysis model with 300 speaker factors and 200 channel factors can achieve equal error rates of less than 3.0%. This is a substantial improvement over the best results that have previously been reported on this task.
  • Keywords
    Gaussian processes; speaker recognition; Gaussian mixture model; channel factors; cross-channel condition; interspeaker variability; joint factor analysis; speaker factors; speaker recognition; speaker verification; Channel factors; Gaussian mixture model (GMM); speaker factors; speaker verification;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2008.925147
  • Filename
    4531370