• DocumentCode
    2875533
  • Title

    Four weightings and a fusion: a cepstral-SVM system for speaker recognition

  • Author

    Kajarekar, Sachin S.

  • Author_Institution
    Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA
  • fYear
    2005
  • fDate
    27-27 Nov. 2005
  • Firstpage
    17
  • Lastpage
    22
  • Abstract
    A new speaker recognition system is described that uses Mel-frequency cepstral features. This system is a combination of four support vector machines (SVMs). All the SVM systems use polynomial features and they are trained and tested independently using a linear inner-product kernel. Scores from each system are combined with equal weight to generate the final score. We evaluate the combined SVM system using extensive development sets with diverse recording conditions. These sets include NIST 2003, 2004 and 2005 speaker recognition evaluation datasets, and FISHER data. The results show that for 1-side training, the combined SVM system gives comparable performance to a system using cepstral features with a Gaussian mixture model (baseline), and combination of the two systems improves the baseline performance. For 8-side training, the combined SVM system is able to take advantage of more data and gives a 29% improvement over the baseline system
  • Keywords
    Gaussian processes; cepstral analysis; speaker recognition; support vector machines; Gaussian mixture model; Mel-frequency cepstral features; cepstral-SVM system; speaker recognition; support vector machines; Cepstral analysis; Databases; Kernel; Mel frequency cepstral coefficient; NIST; Polynomials; Speaker recognition; Speech; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
  • Conference_Location
    San Juan
  • Print_ISBN
    0-7803-9478-X
  • Electronic_ISBN
    0-7803-9479-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2005.1566506
  • Filename
    1566506