• DocumentCode
    1754993
  • Title

    Factorized Sub-Space Estimation for Fast and Memory Effective I-vector Extraction

  • Author

    Cumani, Sandro ; Laface, Pietro

  • Author_Institution
    Dipt. di Autom. e Inf., Politec. di Torino, Turino, Italy
  • Volume
    22
  • Issue
    1
  • fYear
    2014
  • fDate
    Jan. 2014
  • Firstpage
    248
  • Lastpage
    259
  • Abstract
    Most of the state-of-the-art speaker recognition systems use a compact representation of spoken utterances referred to as i-vector. Since the “standard” i-vector extraction procedure requires large memory structures and is relatively slow, new approaches have recently been proposed that are able to obtain either accurate solutions at the expense of an increase of the computational load, or fast approximate solutions, which are traded for lower memory costs. We propose a new approach particularly useful for applications that need to minimize their memory requirements. Our solution not only dramatically reduces the memory needs for i-vector extraction, but is also fast and accurate compared to recently proposed approaches. Tested on the female part of the tel-tel extended NIST 2010 evaluation trials, our approach substantially improves the performance with respect to the fastest but inaccurate eigen-decomposition approach, using much less memory than other methods.
  • Keywords
    speaker recognition; approximate solutions; computational load; eigen-decomposition approach; factorized subspace estimation; memory effective I-vector extraction; memory requirements; speaker recognition systems; standard i-vector extraction; tel-tel extended NIST 2010 evaluation; Dictionaries; Memory management; Sparse matrices; Speech; Speech processing; Standards; Vectors; Dictionary; i-vector extraction; i-vectors; probabilistic linear discriminant analysis; speaker recognition;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2013.2290505
  • Filename
    6661348