DocumentCode
2532800
Title
Comparison of vector normalization methods in multi-level speaker verification
Author
Drgas, Szymon ; Dabrowski, Adam
Author_Institution
Control & Syst. Eng., Poznan Univ. of Technol., Poznan, Poland
fYear
2012
fDate
18-21 Sept. 2012
Firstpage
1
Lastpage
6
Abstract
In this article a text-independent speaker verification problem is considered. After the feature extraction, each conversation side has been represented as a vector in a fixed dimensional space. In order to reduce an influence of the lengths of utterances and also the channel properties, various vector normalization techniques have been selected from the literature, modified, and tested. Additionally, it is shown that if the vectors are transformed in such a way that they lie on the unit hypersphere, better numerical properties of the respective kernel matrices can be achieved. Finally, various normalization methods as well for continuous features (i.e., the cosine kernel, the variance normalization, the cosine similarity merged with the variance normalization, and the variance normalization merged with the spherical normalization) as for the discrete features (again the cosine kernel, the TFLOG, the TFLOG merged with the cosine similarity metric, and the TFLOG merged with the spherical normalization) are tested and compared in this article.
Keywords
feature extraction; matrix algebra; speaker recognition; channel property; feature extraction; fixed dimensional space; kernel matrices; multilevel speaker verification; numerical property; text-independent speaker verification problem; unit hypersphere; vector normalization methods; Accuracy; Feature extraction; Kernel; Speech; Training; Training data; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals and Electronic Systems (ICSES), 2012 International Conference on
Conference_Location
Wroclaw
Print_ISBN
978-1-4673-1710-8
Electronic_ISBN
978-1-4673-1709-2
Type
conf
DOI
10.1109/ICSES.2012.6382233
Filename
6382233
Link To Document