مرکز منطقه ای اطلاع رساني علوم و فناوري - Well-calibrated heavy tailed Bayesian speaker verification for microphone speech

DocumentCode :

2176080

Title :

Well-calibrated heavy tailed Bayesian speaker verification for microphone speech

Author :

Senoussaoui, Mohammed ; Kenny, Patrick ; Dumouchel, Pierre ; Castaldo, Fabio

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

4824

Lastpage :

4827

Abstract :

The work presented in this paper is an extension of our two previous works. In the first paper, we proposed a low dimensional feature (i-vectors) extractor which is suit able for both telephone and microphone data of the NIST speaker recognition evaluation dataset. The second paper introduces the use of Probabilistic Linear Discriminant Analysis (PLDA) framework with a heavy tailed distribution for speaker verification. The advantage of PLDA comes from the fact that it does not require eigenchannel modelization nor scores normalization. However, this approach is only known for its success on telephone data speech but not for micro phone data. We propose to overcome this drawback by using PLDA as a second pass at the front-end feature extraction as well as a classifier. We present results on female speakers for the interview-interview condition in NIST2010 SRE. As measured by equal error rate (ERR) and NIST detection cost function (DCF), results with raw scores are 17% better than with score normalization. We have also calibrated our scores and we achieve a minimum and an actual DCF respectively of 0.559 and 0.607.

Keywords :

belief networks; feature extraction; microphones; speaker recognition; DCF; ERR; NIST speaker recognition evaluation dataset; PLDA framework; detection cost function; equal error rate; front-end feature extraction; microphone speech; probabilistic linear discriminant analysis framework; well-calibrated heavy tailed Bayesian speaker verification; Feature extraction; Gaussian distribution; Mel frequency cepstral coefficient; Microphones; NIST; Probabilistic logic; Speech; Probabilistic Linear Discriminant Analysis; Speaker verification; heavy tailed distribution; i-vectors;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947435

Filename :

5947435

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2176080