• DocumentCode
    2532800
  • Title

    Comparison of vector normalization methods in multi-level speaker verification

  • Author

    Drgas, Szymon ; Dabrowski, Adam

  • Author_Institution
    Control & Syst. Eng., Poznan Univ. of Technol., Poznan, Poland
  • fYear
    2012
  • fDate
    18-21 Sept. 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this article a text-independent speaker verification problem is considered. After the feature extraction, each conversation side has been represented as a vector in a fixed dimensional space. In order to reduce an influence of the lengths of utterances and also the channel properties, various vector normalization techniques have been selected from the literature, modified, and tested. Additionally, it is shown that if the vectors are transformed in such a way that they lie on the unit hypersphere, better numerical properties of the respective kernel matrices can be achieved. Finally, various normalization methods as well for continuous features (i.e., the cosine kernel, the variance normalization, the cosine similarity merged with the variance normalization, and the variance normalization merged with the spherical normalization) as for the discrete features (again the cosine kernel, the TFLOG, the TFLOG merged with the cosine similarity metric, and the TFLOG merged with the spherical normalization) are tested and compared in this article.
  • Keywords
    feature extraction; matrix algebra; speaker recognition; channel property; feature extraction; fixed dimensional space; kernel matrices; multilevel speaker verification; numerical property; text-independent speaker verification problem; unit hypersphere; vector normalization methods; Accuracy; Feature extraction; Kernel; Speech; Training; Training data; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals and Electronic Systems (ICSES), 2012 International Conference on
  • Conference_Location
    Wroclaw
  • Print_ISBN
    978-1-4673-1710-8
  • Electronic_ISBN
    978-1-4673-1709-2
  • Type

    conf

  • DOI
    10.1109/ICSES.2012.6382233
  • Filename
    6382233