• DocumentCode
    573188
  • Title

    Discriminative sparse-based feature extraction and dictionary learning for sound classification applications

  • Author

    Seyedin, Sanaz ; Pichevar, Ramin ; Rouat, Jean

  • Author_Institution
    Dept. de Genie Electr. et de Genie Inf., Univ. de Sherbrooke, Sherbrooke, QC, Canada
  • fYear
    2012
  • fDate
    2-5 July 2012
  • Firstpage
    1330
  • Lastpage
    1335
  • Abstract
    This paper presents a novel sparse-based classification algorithm for audio applications such as sound classification. We propose performing the sparse feature extraction, the dictionary learning, and classification processes simultaneously. This discriminative learning procedure for adapting the dictionaries and classifier to each specified audio task, instead of employing pre-defined dictionaries is the main novelty of our work. According to our experiments, applying this algorithm on some Mel-scale spectral features, such as MFCC (Mel Frequency Cepstral Coefficient), instead of raw temporal data can improve the accuracy and execution time significantly. Our proposed discriminative MFCC-sparse features when evaluated on real data consisting of five audio classes, substantially out-performed the non-discriminative ones. The lengths of test segments in our method are less than 0.5 second. This potential of usage for real-time applications is another advantage of our proposed approach.
  • Keywords
    audio signal processing; dictionaries; feature extraction; learning (artificial intelligence); signal classification; Mel frequency cepstral coefficient; Mel-scale spectral features; audio applications; classification processes; dictionaries; dictionary learning; discriminative MFCC-sparse features; discriminative learning; discriminative sparse-based feature extraction; execution time; sound classification; sparse feature extraction; sparse-based classification; temporal data; Accuracy; Dictionaries; Feature extraction; Harmonic analysis; Mel frequency cepstral coefficient; Speech; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
  • Conference_Location
    Montreal, QC
  • Print_ISBN
    978-1-4673-0381-1
  • Electronic_ISBN
    978-1-4673-0380-4
  • Type

    conf

  • DOI
    10.1109/ISSPA.2012.6310500
  • Filename
    6310500