• DocumentCode
    2017786
  • Title

    Multi-feature combination for speaker recognition

  • Author

    Li, Zhi-Yi ; He, Liang ; Zhang, Wei-Qiang ; Liu, Jia

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2010
  • fDate
    Nov. 29 2010-Dec. 3 2010
  • Firstpage
    318
  • Lastpage
    321
  • Abstract
    Combination of different features has been proved to be a good method for improving performance in speech recognition. In speaker recognition (SRE), various features have also been developed to reflect complementary aspects of speaker´s characteristics. This paper proposed an effective multi-feature combination in speaker recognition. In order to avoid the “dimensionality disaster” and to delimit the redundant information, linear discriminant analysis (LDA) is used to reduce the high dimensionality of combined feature to be lower. Then feature-domain channel compensation is applied to improve the performance. In experiments, we use the popular short-term spectral Mel-frequency cepstral coefficients (MFCC) and novel spectro-temporal time-frequency cepstrum (TFC) to do feature combination followed by LDA and feature-domain latent factor analysis (fLFA) for channel compensation respectively. The experimental results on the NIST SRE2008 short2 telephone-short3 telephone test set show that the proposed multi-feature combination is an effective method to outperform both raw features.
  • Keywords
    regression analysis; speaker recognition; time-frequency analysis; channel compensation; feature domain latent factor analysis; linear discriminant analysis; melfrequency cepstral coefficient; speaker recognition; spectrotemporal time-frequency cepstrum; speech recognition; Covariance matrix; Feature extraction; Mel frequency cepstral coefficient; Mutual information; Speaker recognition; Speech; GMM; MFCC; TFC; multi-feature combination;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Chinese Spoken Language Processing (ISCSLP), 2010 7th International Symposium on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4244-6244-5
  • Type

    conf

  • DOI
    10.1109/ISCSLP.2010.5684885
  • Filename
    5684885