DocumentCode :
2164233
Title :
Continuous audio analytics by HMM and Viterbi decoding
Author :
Ramasubramanian, V. ; Karthik, R. ; Thiyagarajan, S. ; Cherla, Srikanth
Author_Institution :
Siemens Corp. Res. & Technol.-India, Bangalore, India
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
2396
Lastpage :
2399
Abstract :
We address the problem of audio analytics with respect to efficient modeling of audio classes and continuous decoding of audio stream to automatically segment and label the audio stream as required in audio indexing. We propose the use of left-to-right HMMs and ergodic HMMs to respectively model definite and indefinite duration audio classes and Viterbi decoding using these HMMs with non-emitting states for continuous decoding of audio streams. We quantify the decoding performance using detection and false-alarm rates and show that the proposed HMM based modeling and Viterbi decoding can have high decoding accuracies with average (%Hit, %False-alarm) of (79.2%, 1.6%), which are significantly better than VQ, GMM and Template based decoding, indicating the viability of the proposed modeling and decoding technique for practical surveillance audio analytics.
Keywords :
Viterbi decoding; audio coding; audio streaming; hidden Markov models; HMM based modeling; Viterbi decoding; audio indexing; audio stream continuous decoding; continuous audio analytics; false-alarm rates; hidden Markov model; template based decoding; Decoding; Hidden Markov models; Indexing; Labeling; Training; Training data; Viterbi algorithm; Audio analytics; Viterbi decoding; audio segmentation and labeling; ergodic HMM; left-to-right HMM;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5946966
Filename :
5946966
Link To Document :
بازگشت