• DocumentCode
    2066974
  • Title

    Parallel Phone Recognizer based MLLR Speaker Recognition

  • Author

    Eryu Wang ; Wu Guo ; Lirong Dai

  • Author_Institution
    iFlytek Speech Lab., Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2008
  • fDate
    16-19 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The method that uses maximum-likelihood linear regression (MLLR) adaptation transformation as features for support vector machine (SVM) has been adopted in recent NIST Speaker Recognition Evaluation (SRE). It is attractive because it makes use of high-level information about the speakers, and it can complement the standard GMM-UBM system. The performance of the system will be affected by the phone recognizer, especially in multi-lingual contexts. In this paper, we use a multi language phone recognizer based MLLR-SVM system, which can deal with the language phone recognizer problem. This system is defined as parallel phone recognizer-MLLR (PPR-MLLR). It has simpler framework than existing MLLR methods and can achieve better performance. In the NIST SRE 06 1 conv4w-1 conv4w task, the system can achieve an EER of 5.44%. Furthermore, we can achieve an EER of 4.20% which is almost a 20% system performance improvement when combined with the cepstral GMM-UBM system.
  • Keywords
    maximum likelihood estimation; regression analysis; speaker recognition; support vector machines; GMM-UBM system; MLLR speaker recognition; NIST speaker recognition evaluation; maximum-likelihood linear regression adaptation transformation; multi-lingual contexts; parallel phone recognizer-MLLR; support vector machine; Cepstral analysis; Linear regression; Loudspeakers; Maximum likelihood linear regression; NIST; Natural languages; Speaker recognition; Speech; Support vector machines; Telephone sets;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2942-4
  • Electronic_ISBN
    978-1-4244-2943-1
  • Type

    conf

  • DOI
    10.1109/CHINSL.2008.ECP.91
  • Filename
    4730345