DocumentCode
2164233
Title
Continuous audio analytics by HMM and Viterbi decoding
Author
Ramasubramanian, V. ; Karthik, R. ; Thiyagarajan, S. ; Cherla, Srikanth
Author_Institution
Siemens Corp. Res. & Technol.-India, Bangalore, India
fYear
2011
fDate
22-27 May 2011
Firstpage
2396
Lastpage
2399
Abstract
We address the problem of audio analytics with respect to efficient modeling of audio classes and continuous decoding of audio stream to automatically segment and label the audio stream as required in audio indexing. We propose the use of left-to-right HMMs and ergodic HMMs to respectively model definite and indefinite duration audio classes and Viterbi decoding using these HMMs with non-emitting states for continuous decoding of audio streams. We quantify the decoding performance using detection and false-alarm rates and show that the proposed HMM based modeling and Viterbi decoding can have high decoding accuracies with average (%Hit, %False-alarm) of (79.2%, 1.6%), which are significantly better than VQ, GMM and Template based decoding, indicating the viability of the proposed modeling and decoding technique for practical surveillance audio analytics.
Keywords
Viterbi decoding; audio coding; audio streaming; hidden Markov models; HMM based modeling; Viterbi decoding; audio indexing; audio stream continuous decoding; continuous audio analytics; false-alarm rates; hidden Markov model; template based decoding; Decoding; Hidden Markov models; Indexing; Labeling; Training; Training data; Viterbi algorithm; Audio analytics; Viterbi decoding; audio segmentation and labeling; ergodic HMM; left-to-right HMM;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location
Prague
ISSN
1520-6149
Print_ISBN
978-1-4577-0538-0
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2011.5946966
Filename
5946966
Link To Document