Title :
Comparison of vector normalization methods in multi-level speaker verification
Author :
Drgas, Szymon ; Dabrowski, Adam
Author_Institution :
Control & Syst. Eng., Poznan Univ. of Technol., Poznan, Poland
Abstract :
In this article a text-independent speaker verification problem is considered. After the feature extraction, each conversation side has been represented as a vector in a fixed dimensional space. In order to reduce an influence of the lengths of utterances and also the channel properties, various vector normalization techniques have been selected from the literature, modified, and tested. Additionally, it is shown that if the vectors are transformed in such a way that they lie on the unit hypersphere, better numerical properties of the respective kernel matrices can be achieved. Finally, various normalization methods as well for continuous features (i.e., the cosine kernel, the variance normalization, the cosine similarity merged with the variance normalization, and the variance normalization merged with the spherical normalization) as for the discrete features (again the cosine kernel, the TFLOG, the TFLOG merged with the cosine similarity metric, and the TFLOG merged with the spherical normalization) are tested and compared in this article.
Keywords :
feature extraction; matrix algebra; speaker recognition; channel property; feature extraction; fixed dimensional space; kernel matrices; multilevel speaker verification; numerical property; text-independent speaker verification problem; unit hypersphere; vector normalization methods; Accuracy; Feature extraction; Kernel; Speech; Training; Training data; Vectors;
Conference_Titel :
Signals and Electronic Systems (ICSES), 2012 International Conference on
Conference_Location :
Wroclaw
Print_ISBN :
978-1-4673-1710-8
Electronic_ISBN :
978-1-4673-1709-2
DOI :
10.1109/ICSES.2012.6382233