Title :
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing
Author :
Kalinli, Ozlem ; Sundaram, Shiva ; Narayanan, Shrikanth
Author_Institution :
Dept. of Electr. Eng.-Syst., Univ. of Southern California, Los Angeles, CA, USA
Abstract :
Automatic acoustic scene classification of real life, complex and unstructured acoustic scenes is a challenging task as the number of acoustic sources present in the audio stream are unknown and overlapping in time. In this work, we present a novel approach to classification such unstructured acoustic scenes. Motivated by the bottom-up attention model of the human auditory system, salient events of an audio clip are extracted in an unsupervised manner and presented to the classification system. Similar to latent semantic indexing of text documents, the classification system uses unit-document frequency measure to index the clip in a continuous, latent space. This allows for developing a completely class-independent approach to audio classification. Our results on the BBC sound effects library indicates that using the saliency-driven attention selection approach presented in this paper, 17.5% relative improvement can be obtained in frame-based classification and 25% relative improvement can be obtained using the latent audio indexing approach.
Keywords :
acoustic signal processing; signal classification; audio classification; automatic acoustic scene classification; latent perceptual indexing; saliency-driven unstructured acoustic scene classification; unit-document frequency; Acoustic measurements; Auditory system; Extraterrestrial measurements; Frequency measurement; Humans; Indexing; Layout; Libraries; Measurement units; Streaming media;
Conference_Titel :
Multimedia Signal Processing, 2009. MMSP '09. IEEE International Workshop on
Conference_Location :
Rio De Janeiro
Print_ISBN :
978-1-4244-4463-2
Electronic_ISBN :
978-1-4244-4464-9
DOI :
10.1109/MMSP.2009.5293267