• DocumentCode
    3528754
  • Title

    iFLY system for the NIST 2008 speaker recognition evaluation

  • Author

    Guo, Wu ; Long, Yanhua ; Li, Yijie ; Pan, Lei ; Wang, Eryu ; Dai, Lirong

  • Author_Institution
    MOE-Microsoft Key Lab. of Multimedia Comput. & Commun., Univ. of Sci. & Technol. of China (USTC)
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    4209
  • Lastpage
    4212
  • Abstract
    The description of iFLY system submitted for NIST 2008 speaker recognition evaluation (SRE), which has achieved excellent performance in the 2008 SRE evaluation, is presented in this paper. Our primary system is a fusion of two subsystems GMM-UBM and GMM-SVM. For each sub-system, two kinds of short-time acoustic features PLP and LPCC are adopted. We focus on three key issues in this evaluation: channel compensation, multi-lingual or bi-lingual cues and the voice activity detection. We also point out that data selection and factor analysis play key roles in the system improvement.
  • Keywords
    Gaussian processes; acoustic signal processing; speaker recognition; support vector machines; GMM-SVM; GMM-UBM; LPCC; PLP; iFLY system; short-time acoustic features; speaker recognition evaluation; Feature extraction; Frequency modulation; Laboratories; Microphones; NIST; Noise reduction; Speaker recognition; Speech analysis; Telephony; Testing; GMM; NAP; joint factor analysis; speaker verification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4960557
  • Filename
    4960557