DocumentCode :
3317101
Title :
Relative effectiveness of score normalization methods in speaker identification fusing acoustic and prosodic information
Author :
Zheng, Rong ; Zhang, Shuwu ; Xu, Bo
Author_Institution :
Inst. of Autom., Chinese Acad. of Sci., Beijing, China
fYear :
2005
fDate :
30 Oct.-1 Nov. 2005
Firstpage :
164
Lastpage :
168
Abstract :
In this paper, an investigation on the fusion of acoustic and prosodic information for GMM-UBM based text-independent speaker identification is presented. When acoustic and prosodic based systems are established, it is advantageous to normalize the dynamic ranges of the score dimensions, that is, likelihood scores from different quality of acoustic- and prosodic-based models. Score normalization methods, linear scaling to unit range and linear scaling to unit variance, are applied to transform the output scores using the background instances so as to obtain meaningful comparison between speaker models. In this fusion system based on linear score weighting approach, the performance of speaker identification is further improved when incorporating prosodic level of information. Experimental results on part of the NIST 1999 SRE corpus are reported.
Keywords :
Gaussian processes; acoustic signal processing; sensor fusion; speaker recognition; speech processing; GMM-UBM based text-independent speaker identification; acoustic information; linear scaling; linear score weighting approach; prosodic information fusion; score normalization methods; Acoustic testing; Automation; Cepstral analysis; Dynamic range; Frequency; Loudspeakers; NIST; Speaker recognition; Speech; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
Type :
conf
DOI :
10.1109/NLPKE.2005.1598727
Filename :
1598727
Link To Document :
بازگشت