Title :
Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization
Author :
Cauchi, Benjamin ; Lagrange, Mathieu ; Misdariis, Nicolas ; Cont, Arshia
Author_Institution :
Hearing, Speech & Audio Technol., FRAUNHOFER IDMT, Oldenburg, Germany
Abstract :
The modeling of auditory scenes is a challenging task in Computational Auditory Scene Analysis. A method based on sparse Non-negative Matrix Factorization that can be used with no prior knowledge of the audio content to establish the similarity between scenes is proposed in this work. It is then evaluated on a corpus of soundscapes of train stations from a perceptual study and results are compared with the human perception. The proposed method, by being able to focus on salient events within the scene, achieves better performances than a state-of-the-art Bag-of-Frames approach though not reaching the human performances.
Keywords :
acoustic signal processing; audio signal processing; matrix decomposition; acoustic scene; audio content; auditory scene modeling; computational auditory scene analysis; human perception; perceptual study; saliency based modeling; sparse nonnegative matrix factorization; train station; Analytical models; Computational modeling; Cost function; Dictionaries; Mel frequency cepstral coefficient; Vectors;
Conference_Titel :
Image Analysis for Multimedia Interactive Services (WIAMIS), 2013 14th International Workshop on
Conference_Location :
Paris
DOI :
10.1109/WIAMIS.2013.6616131