• DocumentCode
    2179896
  • Title

    Improved speaker recognition when using i-vectors from multiple speech sources

  • Author

    McLaren, Mitchell ; Van Leeuwen, David

  • Author_Institution
    Centre for Language & Speech Technol., Radboud Univ., Nijmegen, Netherlands
  • fYear
    2011
  • fDate
    22-27 May 2011
  • Firstpage
    5460
  • Lastpage
    5463
  • Abstract
    The concept of speaker recognition using i-vectors was recently introduced offering state-of-the-art performance. An i-vector is a compact representation of a speaker´s utterance after projection into a low-dimensional, total variability subspace trained using factor analysis. A secondary process involving linear discriminant analysis (LDA) is then used to improve the discrimination of i-vectors from different speakers. The newness of this technology invokes the question as to the best way to train the total variability subspace and LDA matrix when using speech collected from distinctly different sources. This paper presents a comparative study of a number of subspace training techniques and a novel source-normalised and-weighted LDA algorithm for the purpose of improving i-vector based speaker recognition under mis-matched evaluation conditions. Results from the NIST 2010 speaker recognition evaluation (SRE) suggest that accounting for source conditions in the LDA matrix as opposed to the total variability subspace training regime provides improved robustness to mis-matched evaluation conditions.
  • Keywords
    matrix algebra; speaker recognition; LDA matrix; NIST 2010 speaker recognition evaluation; SRE; i-vectors; improved speaker recognition; linear discriminant analysis matrix; multiple speech sources; source-normalised-and-weighted LDA algorithm; Interviews; Microphones; NIST; Robustness; Speaker recognition; Speech; Training; i-vector; linear discriminant analysis; source conditions; speaker recognition; total variability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
  • Conference_Location
    Prague
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4577-0538-0
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2011.5947594
  • Filename
    5947594