• DocumentCode
    3311277
  • Title

    Independent component analysis for audio classification

  • Author

    Kamath, Sunil ; Ravindran, Sourabh ; Anderson, David V.

  • Author_Institution
    Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2004
  • fDate
    1-4 Aug. 2004
  • Firstpage
    352
  • Lastpage
    355
  • Abstract
    In this paper, we explore the performance gains achieved by performing independent component analysis (ICA) decomposition on speech features obtained from a model of the early auditory system. ICA projection achieves dimensionality reduction by reducing the redundancy in the feature set (the transformed features are statistically independent). Performance is evaluated for an audio classification environment using a Gaussian mixture model (GMM) classifier and compared against the classification performance of AdaBoost, a wide-margin boosting algorithm. The new features are compared with mel-frequency cepstral coefficients (MFCC) and perceptual linear prediction (PLP) features. We also show that the ICA transformation is well suited for dimensionality reduction of auditory system-inspired features and it significantly improves the classification accuracy.
  • Keywords
    Gaussian distribution; audio signal processing; cepstral analysis; feature extraction; independent component analysis; signal classification; AdaBoost wide-margin boosting algorithm; GMM classifier; Gaussian mixture model; ICA decomposition; ICA projection; MFCC; PLP features; audio classification; classification accuracy; dimensionality reduction; early auditory system model; feature extraction; feature set redundancy reduction; independent component analysis; mel-frequency cepstral coefficients; perceptual linear prediction; speech features; statistically independent transformed features; Auditory system; Image processing; Independent component analysis; Mel frequency cepstral coefficient; Performance gain; Principal component analysis; Signal processing; Sparse matrices; Speech analysis; Speech processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Signal Processing Workshop, 2004 and the 3rd IEEE Signal Processing Education Workshop. 2004 IEEE 11th
  • Print_ISBN
    0-7803-8434-2
  • Type

    conf

  • DOI
    10.1109/DSPWS.2004.1437974
  • Filename
    1437974