• DocumentCode
    3716246
  • Title

    Audio salient event detection and summarization using audio and text modalities

  • Author

    Athanasia Zlatintsi;Elias Iosif;Petros Marago;Alexandros Potamianos

  • Author_Institution
    School of ECE, National Technical University of Athens, Greece
  • fYear
    2015
  • Firstpage
    2311
  • Lastpage
    2315
  • Abstract
    This paper investigates the problem of audio event detection and summarization, building on previous work [1,2] on the detection of perceptually important audio events based on saliency models. We take a synergistic approach to audio summarization where saliency computation of audio streams is assisted by using the text modality as well. Auditory saliency is assessed by auditory and perceptual cues such as Teager energy, loudness and roughness; all known to correlate with attention and human hearing. Text analysis incorporates part-of-speech tagging and affective modeling. A computational method for the automatic correction of the boundaries of the selected audio events is applied creating summaries that consist not only of salient but also meaningful and semantically coherent events. A non-parametric classification technique is employed and results are reported on the MovSum movie database using objective evaluations against ground-truth designating the auditory and semantically salient events.
  • Keywords
    "Motion pictures","Feature extraction","Semantics","Event detection","Text analysis","Databases","Speech"
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference (EUSIPCO), 2015 23rd European
  • Electronic_ISBN
    2076-1465
  • Type

    conf

  • DOI
    10.1109/EUSIPCO.2015.7362797
  • Filename
    7362797