• DocumentCode
    2393634
  • Title

    Handset-dependent background models for robust text-independent speaker recognition

  • Author

    Heck, Larry P. ; Weintraub, Mitchel

  • Author_Institution
    Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1071
  • Abstract
    This paper studies the effects of handset distortion on telephone-based speaker recognition performance, resulting in the following observations: (1) the major factor in speaker recognition errors is whether the handset type (e.g., electret, carbon) is different across training and testing, not whether the telephone lines are mismatched, (2) the distribution of speaker recognition scores for true speakers is bimodal, with one mode dominated by matched handset tests and the other by mismatched handsets, (3) cohort-based normalization methods derive much of their performance gains from implicitly selecting cohorts trained with the same handset type as the claimant, and (4) utilizing a handset-dependent background model which is matched to the handset type of the claimant´s training data sharpens and separates the true and false speaker score distributions. Results on the 1996 NIST Speaker Recognition Evaluation corpus show that using handset-matched background models reduces false acceptances (at a 10% miss rate) by more than 60% over previously reported (handset-independent) approaches
  • Keywords
    cepstral analysis; error compensation; speaker recognition; telephone sets; 1996 NIST Speaker Recognition Evaluation corpus; bimodal distribution; cohort-based normalization methods; false speaker score distribution; handset distortion effects; handset-dependent background models; matched handset tests; mismatched handsets; performance gain; robust text-independent speaker recognition; telephone-based speaker recognition; training data; true speaker score distribution; Cepstral analysis; Degradation; Electrets; Laboratories; Loudspeakers; Robustness; Speaker recognition; Speech; Telephone sets; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596126
  • Filename
    596126