DocumentCode :
187664
Title :
The relevance of NIST speaker recognition evaluations
Author :
Asha, T. ; Murthy, Hema A.
Author_Institution :
Indian Inst. of Technol., Madras, Chennai, India
fYear :
2014
fDate :
22-25 July 2014
Firstpage :
1
Lastpage :
6
Abstract :
Feature extraction and building of the Universal Background Model (UBM) are crucial for building speaker verification/identification systems in the total variability subspace (TVS) framework. The motivation of this study is to analyze the significance of various parameters involved in front end processing for different databases. A number of different parameters like energy threshold for voice activity detection, the number of filters, the warping of the frequency scale, the number of cepstral coefficients and the shape of the filter are studied. Three different databases namely, NIST 2003, NIST 2010 and NTIMIT are studied. The optimal front-end obtained using NIST 2003 is observed to function well for NIST 2010 as conditions involving similar data was evaluated for both the databases. On the other hand, it is shown that the same optimal front-end is not scalable for NTIMIT database which is collected from a different environment. The experiments performed in this paper indicate that the optimal front-end parameters are specific to a particular dataset. In addition, mismatch between development data and evaluation data is shown to result in a poor system. Given the results, the paper questions the relevance of the NIST Speaker Recognition evaluations in real environments.
Keywords :
feature extraction; speaker recognition; NIST speaker recognition evaluations; NTIMIT database; TVS framework; UBM; cepstral coefficients; databases; feature extraction; speaker identification systems; speaker verification; total variability subspace; universal background model; voice activity detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Communications (SPCOM), 2014 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-4666-2
Type :
conf
DOI :
10.1109/SPCOM.2014.6983988
Filename :
6983988
Link To Document :
بازگشت