Title :
Large-scale audio feature extraction and SVM for acoustic scene classification
Author :
Geiger, Jurgen T. ; Schuller, Bjorn ; Rigoll, Gerhard
Author_Institution :
Inst. for Human-Machine Commun., Tech. Univ. Munchen, München, Germany
Abstract :
This work describes a system for acoustic scene classification using large-scale audio feature extraction. It is our contribution to the Scene Classification track of the IEEE AASP Challenge on Detection and Classification of Acoustic Scenes and Events (D-CASE). The system classifies 30 second long recordings of 10 different acoustic scenes. From the highly variable recordings, a large number of spectral, cepstral, energy and voicing-related audio features are extracted. Using a sliding window approach, classification is performed on short windows. SVM are used to classify these short segments, and a majority voting scheme is employed to get a decision for longer recordings. On the official development set of the challenge, an accuracy of 73 % is achieved. SVM are compared with a nearest neighbour classifier and an approach called Latent Perceptual Indexing, whereby SVM achieve the best results. A feature analysis using the t-statistic shows that mainly Mel spectra are the most relevant features.
Keywords :
audio signals; feature extraction; signal classification; statistical analysis; support vector machines; D-CASE; IEEE AASP; Mel spectra; SVM; acoustic scene classification; acoustic scenes; large-scale audio feature extraction; latent perceptual indexing; nearest neighbour classifier; scene classification track; short segments; sliding window approach; support vector machines; t-statistic; variable recordings; Accuracy; Feature extraction; Mel frequency cepstral coefficient; Support vector machines; Training; Training data; Computational auditory scene analysis; acoustic scene recognition; feature extraction;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location :
New Paltz, NY
DOI :
10.1109/WASPAA.2013.6701857