DocumentCode
573188
Title
Discriminative sparse-based feature extraction and dictionary learning for sound classification applications
Author
Seyedin, Sanaz ; Pichevar, Ramin ; Rouat, Jean
Author_Institution
Dept. de Genie Electr. et de Genie Inf., Univ. de Sherbrooke, Sherbrooke, QC, Canada
fYear
2012
fDate
2-5 July 2012
Firstpage
1330
Lastpage
1335
Abstract
This paper presents a novel sparse-based classification algorithm for audio applications such as sound classification. We propose performing the sparse feature extraction, the dictionary learning, and classification processes simultaneously. This discriminative learning procedure for adapting the dictionaries and classifier to each specified audio task, instead of employing pre-defined dictionaries is the main novelty of our work. According to our experiments, applying this algorithm on some Mel-scale spectral features, such as MFCC (Mel Frequency Cepstral Coefficient), instead of raw temporal data can improve the accuracy and execution time significantly. Our proposed discriminative MFCC-sparse features when evaluated on real data consisting of five audio classes, substantially out-performed the non-discriminative ones. The lengths of test segments in our method are less than 0.5 second. This potential of usage for real-time applications is another advantage of our proposed approach.
Keywords
audio signal processing; dictionaries; feature extraction; learning (artificial intelligence); signal classification; Mel frequency cepstral coefficient; Mel-scale spectral features; audio applications; classification processes; dictionaries; dictionary learning; discriminative MFCC-sparse features; discriminative learning; discriminative sparse-based feature extraction; execution time; sound classification; sparse feature extraction; sparse-based classification; temporal data; Accuracy; Dictionaries; Feature extraction; Harmonic analysis; Mel frequency cepstral coefficient; Speech; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
Conference_Location
Montreal, QC
Print_ISBN
978-1-4673-0381-1
Electronic_ISBN
978-1-4673-0380-4
Type
conf
DOI
10.1109/ISSPA.2012.6310500
Filename
6310500
Link To Document