Title :
Exploring the role of temporal dynamics in acoustic scene classification
Author :
Debmalya Chakrabarty;Mounya Elhilali
Author_Institution :
Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD, USA
Abstract :
Identification of acoustic scenes often relies on finding the most informative features that best characterize the physical nature of sound sources in the scene. In this paper, we propose a framework that provides a detailed local analysis of spectro-temporal modulations augmented with generative modeling that map both the average modulation statistics of the scene using Gaussian Mixture Modeling (GMM) as well temporal trajectories of these modulations using Hidden Markov Modeling (HMM). Our analysis shows that a hybrid system of these two representations can capture the non-trivial commonalities within a sound class and differences between sound classes. The proposed hybrid system outperforms current systems in the literature by about 30 % and surpasses the performance of the individual GMM and HMM systems suggesting that these representations provide complimentary information about acoustic scenes.
Keywords :
"Hidden Markov models","Modulation","Mel frequency cepstral coefficient","Trajectory","Time-frequency analysis","Tensile stress"
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015 IEEE Workshop on
DOI :
10.1109/WASPAA.2015.7336898