Continuous audio analytics by HMM and Viterbi decoding

Author

Ramasubramanian, V. ; Karthik, R. ; Thiyagarajan, S. ; Cherla, Srikanth

Author_Institution

Siemens Corp. Res. & Technol.-India, Bangalore, India

fYear

2011

fDate

22-27 May 2011

Firstpage

2396

Lastpage

2399

Abstract

We address the problem of audio analytics with respect to efficient modeling of audio classes and continuous decoding of audio stream to automatically segment and label the audio stream as required in audio indexing. We propose the use of left-to-right HMMs and ergodic HMMs to respectively model definite and indefinite duration audio classes and Viterbi decoding using these HMMs with non-emitting states for continuous decoding of audio streams. We quantify the decoding performance using detection and false-alarm rates and show that the proposed HMM based modeling and Viterbi decoding can have high decoding accuracies with average (%Hit, %False-alarm) of (79.2%, 1.6%), which are significantly better than VQ, GMM and Template based decoding, indicating the viability of the proposed modeling and decoding technique for practical surveillance audio analytics.

Keywords

Viterbi decoding; audio coding; audio streaming; hidden Markov models; HMM based modeling; Viterbi decoding; audio indexing; audio stream continuous decoding; continuous audio analytics; false-alarm rates; hidden Markov model; template based decoding; Decoding; Hidden Markov models; Indexing; Labeling; Training; Training data; Viterbi algorithm; Audio analytics; Viterbi decoding; audio segmentation and labeling; ergodic HMM; left-to-right HMM;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location

Prague

ISSN

1520-6149

Print_ISBN

978-1-4577-0538-0

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2011.5946966

Filename

5946966