• DocumentCode
    179003
  • Title

    Inter dataset variability compensation for speaker recognition

  • Author

    Aronowitz, Hagai

  • Author_Institution
    IBM Res. - Haifa Haifa, Haifa, Israel
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    4002
  • Lastpage
    4006
  • Abstract
    Recently satisfactory results have been obtained in NIST speaker recognition evaluations. These results are mainly due to accurate modeling of a very large development dataset provided by LDC. However, for many realistic scenarios the use of this development dataset is limited due to a dataset mismatch. In such cases, collection of a large enough dataset is infeasible. In this work we analyze the sources of degradation for a particular setup in the context of an i-vector PLDA system and conclude that the main source for degradation is an i-vector dataset shift. As a remedy, we introduce inter dataset variability compensation (IDVC) to explicitly compensate for dataset shift in the i-vector space. This is done using the nuisance attribute projection (NAP) method. Using IDVC we managed to reduce error dramatically by more than 50% for the domain mismatch setup.
  • Keywords
    probability; speaker recognition; vectors; IDVC; NAP; dataset mismatch; i-vector PLDA system; i-vector dataset shift; interdataset variability compensation; nuisance attribute projection; partial linear discriminant analysis; speaker recognition; Conferences; Mixers; NIST; Robustness; Speaker recognition; Tin; domain adaptation challenge; i-vector; inter dataset variability compensation; robust speaker recognition; speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6854353
  • Filename
    6854353