مرکز منطقه ای اطلاع رساني علوم و فناوري - Text-independent speaker verification using utterance level scoring and covariance modeling

DocumentCode :

839074

Title :

Text-independent speaker verification using utterance level scoring and covariance modeling

Author :

Zilca, Ran D.

Author_Institution :

Res. & Dev. Div., Amdocs, Israel

Volume :

Issue :

fYear :

2002

fDate :

9/1/2002 12:00:00 AM

Firstpage :

363

Lastpage :

370

Abstract :

This paper describes a computationally simple method to perform text independent speaker verification using second order statistics. The suggested method, called utterance level scoring (ULS), allows one to obtain a normalized score using a single pass through the frames of the tested utterance. The utterance sample covariance is first calculated and then compared to the speaker covariance using a distortion measure. Subsequently, a distortion measure between the utterance covariance and the sample covariance of data taken from different speakers is used to normalize the score. Experimental results from the 2000 NIST speaker recognition evaluation are presented for ULS, used with different distortion measures, and for a Gaussian mixture model (GMM) system. The results indicate that ULS as a viable alternative to GMM whenever the computational complexity and verification accuracy needs to be traded.

Keywords :

Gaussian distribution; computational complexity; covariance analysis; speaker recognition; statistical analysis; GMM; Gaussian mixture model; NIST speaker recognition evaluation; computational complexity; computationally simple method; covariance modeling; distortion measure; distortion measures; normalized score; sample covariance; second order statistics; speaker covariance; text-independent speaker verification; utterance level scoring; verification accuracy; Computational complexity; Data mining; Distortion measurement; NIST; Probability density function; Radio access networks; Speaker recognition; Speech; Statistics; Testing;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/TSA.2002.803419

Filename :

1040260

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=839074