• DocumentCode
    2799771
  • Title

    Automatic state discovery for unstructured audio scene classification

  • Author

    Ramos, Julian ; Siddiqi, Sajid ; Dubrawski, Artur ; Gordon, Geoffrey ; Sharma, Abhishek

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    2154
  • Lastpage
    2157
  • Abstract
    In this paper we present a novel scheme for unstructured audio scene classification that possesses three highly desirable and powerful features: autonomy, scalability, and robustness. Our scheme is based on our recently introduced machine learning algorithm called Simultaneous Temporal And Contextual Splitting (STACS) that discovers the appropriate number of states and efficiently learns accurate Hidden Markov Model (HMM) parameters for the given data. STACS-based algorithms train HMMs up to five times faster than Baum-Welch, avoid the overfitting problem commonly encountered in learning large state-space HMMs using Expectation Maximization (EM) methods such as Baum-Welch, and achieve superior classification results on a very diverse dataset with minimal pre-processing. Furthermore, our scheme has proven to be highly effective for building real-world applications and has been integrated into a commercial surveillance system as an event detection component.
  • Keywords
    audio signal processing; expectation-maximisation algorithm; hidden Markov models; learning (artificial intelligence); surveillance; STACS; automatic state discovery; commercial surveillance system; event detection component; expectation maximization methods; hidden Markov model; machine learning; simultaneous temporal and contextual splitting; unstructured audio scene classification; Data mining; Event detection; Hidden Markov models; Layout; Machine learning algorithms; Robustness; Speech; State-space methods; Surveillance; Topology; HiddenMarkovModels; audio classification; topology learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495605
  • Filename
    5495605