• DocumentCode
    2368497
  • Title

    Efficient speaker verification system using speaker model clustering for T and Z normalizations

  • Author

    Ravulakollu, Kiran ; Apsingekar, Vijendra Raj ; De Leon, Phillip L.

  • Author_Institution
    Klipsch Sch. of Electr. Eng., New Mexico State Univ., Las Cruces, NM
  • fYear
    2008
  • fDate
    13-16 Oct. 2008
  • Firstpage
    56
  • Lastpage
    62
  • Abstract
    In speaker verification (SV) systems based on Gaussian mixture model-universal background model (GMM-UBM), normalization is an important component in the decision stage. Many normalization methods including the T- and Z-norms, have been proposed and investigated and these have contributed to state-of-the-art SV systems which have extremely low equal-error rates (EERs). In this paper, we consider application of both T- and Z-norms to a carefully selected subset of speakers using a data driven approach which can significantly reduce computation resulting in faster SV decisions and lower EER. Unfortunately, selection of the subset is critical and must be representative of the entire speaker model space otherwise error rates will increase. In order to properly select the subset of speakers for the normalizations, we propose a novel method which first clusters the speaker models using the K-means algorithm and the Kullback-Leibler (KL) divergence and then selects a set of speakers within the cluster. We evaluate the approach using both the TIMIT, NTIMIT and NIST-2002 corpora and compare against standard T- and Z-normalizations.
  • Keywords
    Gaussian processes; decision making; error analysis; pattern clustering; speaker recognition; Gaussian mixture model-universal background model; K-means algorithm; Kullback-Leibler divergence; T normalization; Z normalization; equal-error rates; speaker model clustering; speaker model space; speaker verification system; Clustering algorithms; Decision making; Error analysis; Parameter estimation; Support vector machine classification; Support vector machines; Testing; Uncertainty; Weight control; Working environment noise; Clustering methods; Speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Security Technology, 2008. ICCST 2008. 42nd Annual IEEE International Carnahan Conference on
  • Conference_Location
    Prague
  • Print_ISBN
    978-1-4244-1816-9
  • Electronic_ISBN
    978-1-4244-1817-6
  • Type

    conf

  • DOI
    10.1109/CCST.2008.4751277
  • Filename
    4751277