Handset-dependent background models for robust text-independent speaker recognition

Author

Heck, Larry P. ; Weintraub, Mitchel

Author_Institution

Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA

Volume

2

fYear

1997

fDate

21-24 Apr 1997

Firstpage

1071

Abstract

This paper studies the effects of handset distortion on telephone-based speaker recognition performance, resulting in the following observations: (1) the major factor in speaker recognition errors is whether the handset type (e.g., electret, carbon) is different across training and testing, not whether the telephone lines are mismatched, (2) the distribution of speaker recognition scores for true speakers is bimodal, with one mode dominated by matched handset tests and the other by mismatched handsets, (3) cohort-based normalization methods derive much of their performance gains from implicitly selecting cohorts trained with the same handset type as the claimant, and (4) utilizing a handset-dependent background model which is matched to the handset type of the claimant´s training data sharpens and separates the true and false speaker score distributions. Results on the 1996 NIST Speaker Recognition Evaluation corpus show that using handset-matched background models reduces false acceptances (at a 10% miss rate) by more than 60% over previously reported (handset-independent) approaches

Keywords

cepstral analysis; error compensation; speaker recognition; telephone sets; 1996 NIST Speaker Recognition Evaluation corpus; bimodal distribution; cohort-based normalization methods; false speaker score distribution; handset distortion effects; handset-dependent background models; matched handset tests; mismatched handsets; performance gain; robust text-independent speaker recognition; telephone-based speaker recognition; training data; true speaker score distribution; Cepstral analysis; Degradation; Electrets; Laboratories; Loudspeakers; Robustness; Speaker recognition; Speech; Telephone sets; Testing;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

Conference_Location

Munich

ISSN

1520-6149

Print_ISBN

0-8186-7919-0

Type

conf

DOI

10.1109/ICASSP.1997.596126

Filename

596126