• DocumentCode
    3153207
  • Title

    Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization

  • Author

    Lin, Kai-Hsiang ; Zhuang, Xiaodan ; Goudeseune, Camille ; King, Sarah ; Hasegawa-Johnson, Mark ; Huang, Thomas S.

  • Author_Institution
    Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • fYear
    2012
  • fDate
    25-30 March 2012
  • Firstpage
    2277
  • Lastpage
    2280
  • Abstract
    We propose a saliency-maximized audio spectrogram as a representation that lets human analysts quickly search for and detect events in audio recordings. By rendering target events as visually salient patterns, this representation minimizes the time and effort needed to examine a recording. In particular, we propose a transformation of a conventional spectrogram that maximizes the mutual information between the spectrograms of isolated target events and the estimated saliency of the overall visual representation. When subjects are shown spectrograms that are saliency-maximized, they perform significantly better in a 1/10-real-time acoustic event detection task.
  • Keywords
    audio recording; audio signal processing; audio-visual systems; audio recordings; human acoustic event detection; human analyst; realtime acoustic event detection task; rendering; saliency maximized audio spectrogram; saliency maximized audio visualization; salient pattern; visual representation; Acoustics; Audio recording; Event detection; Humans; Spectrogram; Speech; Visualization; acoustic event detection; audio visualization; visual saliency;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
  • Conference_Location
    Kyoto
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4673-0045-2
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2012.6288368
  • Filename
    6288368