• DocumentCode
    3587724
  • Title

    Histogram transform model using MFCC features for text-independent speaker identification

  • Author

    Hong Yu ; Zhanyu Ma ; Minyue Li ; Jun Guo

  • Author_Institution
    Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2014
  • Firstpage
    500
  • Lastpage
    504
  • Abstract
    A novel text-independent speaker identification (SI) method is proposed in this paper. This method uses the mel-frequency cepstral coefficients (MFCCs) and the dynamic information among adjacent frames as feature set to capture the speaker´s characteristics. In order to utilize dynamic information, we design super MFCCs feature by cascading 3 neighboring MFCCs frames together. The probability density function (PDF) of these super MFCCs features is estimated by the recently proposed histogram transform (HT) method, which generated more training data by random transforms to realize the histogram PDF estimation and recede the discontinuity problem of the common multivariate histograms computing. Compared to the conventional PDF estimation method, such as Gaussian mixture model, the HT model shows promising improvement in a SI task.
  • Keywords
    cepstral analysis; probability; speaker recognition; transforms; HT method; common multivariate histogram computation; discontinuity problem; dynamic information; feature set; histogram PDF estimation; histogram transform model; mel-frequency cepstral coefficients; probability density function; random transforms; speaker characteristics; super MFCC feature design; super MFCC feature estimation; text-independent SI method; text-independent speaker identification; Feature extraction; Histograms; Mel frequency cepstral coefficient; Silicon; Speech; Training data; Transforms; Gaussian mixture model; Speaker identification; histogram transform model; mel-frequency cepstral coefficients;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signals, Systems and Computers, 2014 48th Asilomar Conference on
  • Print_ISBN
    978-1-4799-8295-0
  • Type

    conf

  • DOI
    10.1109/ACSSC.2014.7094494
  • Filename
    7094494