Title :
Discriminative Acoustic Event Recognition in multimedia recordings
Author :
Khanwalkar, Saurabh ; Saikumar, Guruprasad ; Srivastava, Amit ; Natarajan, Premkumar
Author_Institution :
Raytheon BBN Technol., Cambridge, MA, USA
fDate :
Aug. 29 2011-Sept. 2 2011
Abstract :
In this paper, we describe an Acoustic Event Recognition (AER) system for locating events of interest in the audio stream of multimedia recordings. We focus on two non-speech acoustic events; bomb explosions and gunfire, which typically exist in surveillance videos and are of importance in monitoring and alerting applications. Recognition is performed using a discriminative approach based on Support Vector Machines (SVM). We compare the new approach to a baseline system that utilizes a Hidden Markov Model (HMM)-based classification approach. We performed experiments on a corpus of publicly available video files containing gunfire and explosion events. Our results show that the new discriminative approach, when configured to use a rich combination of acoustic features, achieves a high retrieval precision at a notable recall under noisy conditions. As compared to HMM-based system, we achieved 54% relative improvement in F-score for explosion recognition with 1.5% relative improvement in F-score for gunfire recognition.
Keywords :
audio streaming; hidden Markov models; image classification; image recognition; multimedia communication; multimedia computing; support vector machines; video surveillance; AER system; F-score recognition; HMM-based classification approach; SVM; audio streaming; bomb explosion; discriminative acoustic event recognition system; gunfire recognition; hidden Markov model-based classification approach; multimedia recording; nonspeech acoustic event; support vector machine; video surveillance; Explosions; Feature extraction; Hidden Markov models; Mel frequency cepstral coefficient; Speech; Support vector machines;
Conference_Titel :
Signal Processing Conference, 2011 19th European
Conference_Location :
Barcelona