• DocumentCode
    2176080
  • Title

    Well-calibrated heavy tailed Bayesian speaker verification for microphone speech

  • Author

    Senoussaoui, Mohammed ; Kenny, Patrick ; Dumouchel, Pierre ; Castaldo, Fabio

  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    4824
  • Lastpage
    4827
  • Abstract
    The work presented in this paper is an extension of our two previous works. In the first paper, we proposed a low dimensional feature (i-vectors) extractor which is suit able for both telephone and microphone data of the NIST speaker recognition evaluation dataset. The second paper introduces the use of Probabilistic Linear Discriminant Analysis (PLDA) framework with a heavy tailed distribution for speaker verification. The advantage of PLDA comes from the fact that it does not require eigenchannel modelization nor scores normalization. However, this approach is only known for its success on telephone data speech but not for micro phone data. We propose to overcome this drawback by using PLDA as a second pass at the front-end feature extraction as well as a classifier. We present results on female speakers for the interview-interview condition in NIST2010 SRE. As measured by equal error rate (ERR) and NIST detection cost function (DCF), results with raw scores are 17% better than with score normalization. We have also calibrated our scores and we achieve a minimum and an actual DCF respectively of 0.559 and 0.607.
  • Keywords
    belief networks; feature extraction; microphones; speaker recognition; DCF; ERR; NIST speaker recognition evaluation dataset; PLDA framework; detection cost function; equal error rate; front-end feature extraction; microphone speech; probabilistic linear discriminant analysis framework; well-calibrated heavy tailed Bayesian speaker verification; Feature extraction; Gaussian distribution; Mel frequency cepstral coefficient; Microphones; NIST; Probabilistic logic; Speech; Probabilistic Linear Discriminant Analysis; Speaker verification; heavy tailed distribution; i-vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947435
  • Filename
    5947435