• DocumentCode
    454531
  • Title

    The Contribution of Cepstral and Stylistic Features to SRI´s 2005 NIST Speaker Recognition Evaluation System

  • Author

    Ferrer, Luciana ; Shriberg, Elizabeth ; Kajarekar, Sachin S. ; Stolcke, Andreas ; Sönmez, Kemal ; Venkataraman, Anand ; Bratt, Harry

  • Author_Institution
    SRI Int., Menlo Park, CA
  • Volume
    1
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    Recent work in speaker recognition has demonstrated the advantage of modeling stylistic features in addition to traditional cepstral features, but to date there has been little study of the relative contributions of these different feature types to a state-of-the-art system. In this paper we provide such an analysis, based on SRI´s submission to the NIST 2005 speaker recognition evaluation. The system consists of 7 subsystems (3 cepstral 4 stylistic). By running independent N-way subsystem combinations for increasing values of N, we fines that (1) a monotonic pattern in the choice of the best N systems allows for the inference of subsystem importance; (2) the ordering of subsystems alternates between cepstral and stylistic; (3) syllable-based prosodic features are the strongest stylistic features, and (4) overall subsystem ordering depends crucially on the amount of training data (1 versus 8 conversation sides). Improvements over the baseline cepstral system, when all systems are combined, range from 47% to 67%, with larger improvements for the 8-side condition. These results provide direct evidence of the complementary contributions of cepstral and stylistic features to speaker discrimination
  • Keywords
    cepstral analysis; speaker recognition; SRI 2005 NIST; baseline cepstral system; speaker recognition evaluation system; stylistic features; subsystem importance inference; syllable-based prosodic features; Cepstral analysis; Computer science; Loudspeakers; NIST; Space technology; Speaker recognition; Speech; Telephony; Testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1659967
  • Filename
    1659967