• DocumentCode
    3327155
  • Title

    Identifying in-set and out-of-set speakers using neighborhood information

  • Author

    Angkititrakul, Pongtep ; Hansen, John H L

  • Author_Institution
    Robust Speech Process. Group, Center for Spoken Language Res., Colorado Univ., Boulder, CO, USA
  • Volume
    1
  • fYear
    2004
  • fDate
    17-21 May 2004
  • Abstract
    We study the problem of identifying in-set and out-of-set speakers. The goal is to identify whether an unknown input speaker belongs to either a group of in-set speakers or an unseen out-of-set group. A state-of-the-art GMM classifier, with universal background model (UBM) and standard likelihood ratio test, is used as our baseline system. We propose an alternative hypothesis testing method that employs neighborhood information with respect to each in-set speaker model in the model space based on the Kullback-Leibier divergence. The Bayes factor is used in the verification stage (accept/reject hypothesis). We evaluate the proposed procedure on a clean CORPUS 1 set, and a noisy CORPUS 2 set which contains session-to-session variability. Experiments show an improvement in equal error rate for the system even when in-set speaker models are acoustically close in the model space, and as the size of the in-set speaker group increases.
  • Keywords
    Bayes methods; Gaussian processes; error statistics; speaker recognition; Bayes factor; GMM classifier; Kullback-Leibier divergence; equal error rate; hypothesis testing method; in-set speaker identification; neighborhood information; out-of-set speaker identification; standard likelihood ratio test; universal background model; Bayesian methods; Error analysis; Forensics; Loudspeakers; Natural languages; Robustness; Speaker recognition; Speech processing; System testing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2004. Proceedings. (ICASSP '04). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8484-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2004.1326005
  • Filename
    1326005