DocumentCode
2799771
Title
Automatic state discovery for unstructured audio scene classification
Author
Ramos, Julian ; Siddiqi, Sajid ; Dubrawski, Artur ; Gordon, Geoffrey ; Sharma, Abhishek
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2010
fDate
14-19 March 2010
Firstpage
2154
Lastpage
2157
Abstract
In this paper we present a novel scheme for unstructured audio scene classification that possesses three highly desirable and powerful features: autonomy, scalability, and robustness. Our scheme is based on our recently introduced machine learning algorithm called Simultaneous Temporal And Contextual Splitting (STACS) that discovers the appropriate number of states and efficiently learns accurate Hidden Markov Model (HMM) parameters for the given data. STACS-based algorithms train HMMs up to five times faster than Baum-Welch, avoid the overfitting problem commonly encountered in learning large state-space HMMs using Expectation Maximization (EM) methods such as Baum-Welch, and achieve superior classification results on a very diverse dataset with minimal pre-processing. Furthermore, our scheme has proven to be highly effective for building real-world applications and has been integrated into a commercial surveillance system as an event detection component.
Keywords
audio signal processing; expectation-maximisation algorithm; hidden Markov models; learning (artificial intelligence); surveillance; STACS; automatic state discovery; commercial surveillance system; event detection component; expectation maximization methods; hidden Markov model; machine learning; simultaneous temporal and contextual splitting; unstructured audio scene classification; Data mining; Event detection; Hidden Markov models; Layout; Machine learning algorithms; Robustness; Speech; State-space methods; Surveillance; Topology; HiddenMarkovModels; audio classification; topology learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495605
Filename
5495605
Link To Document