• DocumentCode
    2800255
  • Title

    A comparison of approaches for modeling prosodic features in speaker recognition

  • Author

    Ferrer, Luciana ; Scheffer, Nicolas ; Shriberg, Elizabeth

  • Author_Institution
    Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4414
  • Lastpage
    4417
  • Abstract
    Prosodic information has been successfully used for speaker recognition for more than a decade. The best-performing prosodic system to date has been one based on features extracted over syllables obtained automatically from speech recognition output. The features are then transformed using a Fisher kernel, and speaker models are trained using support vector machines (SVMs). Recently, a simpler version of these features, based on pseudo-syllables was shown to perform well when modeled using joint factor analysis (JFA). In this work, we study the two modeling techniques for the simpler set of features. We show that, for these features, a combination of JFA systems for different sequence lengths greatly outperforms both original modeling methods. Furthermore, we show that the combination of both methods gives significant improvements over the best single system. Overall, a performance improvement of 30% in the detection cost function (DCF) with respect to the two previously published methods is achieved using very simple strategies.
  • Keywords
    feature extraction; speaker recognition; support vector machines; Fisher kernel; JFA; SVM; detection cost function; joint factor analysis; prosodic feature extraction modelling; speaker recognition; support vector machines; Automatic speech recognition; Data mining; Energy measurement; Feature extraction; Kernel; Laboratories; Polynomials; Speaker recognition; Speech recognition; Support vector machines; Joint Factor Analysis; Prosody; Speaker recognition; Support Vector Machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495632
  • Filename
    5495632