• DocumentCode
    179015
  • Title

    Domain adaptation via within-class covariance correction in I-vector based speaker recognition systems

  • Author

    Glembek, O. ; Ma, Jiaxin ; Matejka, Pavel ; Bing Zhang ; Plchot, Oldrich ; Burget, Lukas ; Matsoukas, Spyros

  • Author_Institution
    Raytheon BBN Technol., Cambridge, MA, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    4032
  • Lastpage
    4036
  • Abstract
    In this paper we propose a technique of Within-Class Covariance Correction (WCC) for Linear Discriminant Analysis (LDA) in Speaker Recognition to perform an unsupervised adaptation of LDA to an unseen data domain, and/or to compensate for speaker population difference among different portions of LDA training dataset. The paper follows on the study of source-normalization and inter-database variability compensation techniques which deal with multimodal distribution of i-vectors. On the DARPA RATS (Robust Automatic Transcription of Speech) task, we show that, with two hours of unsupervised data, we improve the Equal-Error Rate (EER) by 17.5%, and 36% relative on the unmatched and semi-matched conditions, respectively. On the Domain Adaptation Challenge we show up to 70% relative EER reduction and we propose a data clustering procedure to identify the directions of the domain-based variability in the adaptation data.
  • Keywords
    compensation; speaker recognition; statistical analysis; DARPA RATS; I-vector based speaker recognition systems; LDA training dataset; WCC; adaptation data; domain adaptation; domain-based variability; equal-error rate; inter-database variability compensation techniques; linear discriminant analysis; multimodal distribution; relative EER reduction; robust automatic transcription of speech task; source-normalization; unsupervised adaptation; within-class covariance correction; Covariance matrices; Feature extraction; Rats; Speaker recognition; Speech; Speech processing; Training; LDA; i-vectors; inter-dataset variability compensation; source normalization; speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854359
  • Filename
    6854359