• DocumentCode
    2697352
  • Title

    Speaker Segmentation and Clustering using Gender Information

  • Author

    Ore, Brian M. ; Slyh, Raymond E. ; Hansen, Eric G.

  • Author_Institution
    General Dynamics Adv. Inf. Syst., Dayton, OH
  • fYear
    2006
  • fDate
    28-30 June 2006
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    This paper considers the segmentation and clustering of conversational speech for the two-wire training (3conv2w) and two-wire testing (1conv2w) conditions of the NIST 2005 speaker recognition evaluation. A notable feature of the system described is that each file is labeled as containing either opposite- or same-gender speakers. The speech segments for opposite-gender files are clustered by gender, while those for same-gender files are processed by agglomerative clustering. By using gender information in the clustering of the opposite-gender files, the equal error rate in the 3conv2w training condition was reduced from 15.2% to 9.9%. For the 1conv2w testing condition, clustering opposite-gender files by gender did not improve performance over agglomerative clustering; however, it was over 100 times faster than agglomerative clustering on the opposite-gender files
  • Keywords
    pattern clustering; speaker recognition; 1conv2w; 3conv2w; NIST 2005; speaker clustering; speaker recognition; two-wire testing; two-wire training; Audio recording; Broadcasting; Humans; Information systems; Laboratories; Maximum likelihood detection; NIST; Speaker recognition; Speech processing; System testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
  • Conference_Location
    San Juan
  • Print_ISBN
    1-424400471-1
  • Electronic_ISBN
    1-4244-0472-X
  • Type

    conf

  • DOI
    10.1109/ODYSSEY.2006.248125
  • Filename
    4013542