DocumentCode
3153207
Title
Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization
Author
Lin, Kai-Hsiang ; Zhuang, Xiaodan ; Goudeseune, Camille ; King, Sarah ; Hasegawa-Johnson, Mark ; Huang, Thomas S.
Author_Institution
Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
fYear
2012
fDate
25-30 March 2012
Firstpage
2277
Lastpage
2280
Abstract
We propose a saliency-maximized audio spectrogram as a representation that lets human analysts quickly search for and detect events in audio recordings. By rendering target events as visually salient patterns, this representation minimizes the time and effort needed to examine a recording. In particular, we propose a transformation of a conventional spectrogram that maximizes the mutual information between the spectrograms of isolated target events and the estimated saliency of the overall visual representation. When subjects are shown spectrograms that are saliency-maximized, they perform significantly better in a 1/10-real-time acoustic event detection task.
Keywords
audio recording; audio signal processing; audio-visual systems; audio recordings; human acoustic event detection; human analyst; realtime acoustic event detection task; rendering; saliency maximized audio spectrogram; saliency maximized audio visualization; salient pattern; visual representation; Acoustics; Audio recording; Event detection; Humans; Spectrogram; Speech; Visualization; acoustic event detection; audio visualization; visual saliency;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6288368
Filename
6288368
Link To Document