• DocumentCode
    1870566
  • Title

    Multimodal summarization of meeting recordings

  • Author

    Erol, Bema ; Lee, Dar-Shyang ; Hul, Jonathan

  • Author_Institution
    Ricoh California Res. Center, Menlo Park, CA, USA
  • Volume
    3
  • fYear
    2003
  • fDate
    6-9 July 2003
  • Abstract
    Recorded meetings are useful only if people can find, access, and browse them easily. Key-frames and video skims are useful representations that can enable quick previewing of the content without actually watching a meeting recording from beginning to end. This paper proposes a new method for creating meeting video skims based on audio and visual activity analysis together with text analysis. Audio activity analysis is performed by analyzing sound directions-indicating different speakers-and audio amplitude. Detection of important visual events in a meeting is achieved by analyzing the localized luminance variations in consideration with the omni-directional property of the video captured by our meeting recording system. Text analysis is based on the term frequency-inverse document frequency measure. The resulting video skims better capture the important meeting content compared to the skims obtained by uniform sampling.
  • Keywords
    audio recording; audio-visual systems; text analysis; video recording; video signal processing; audio activity analysis; meeting recording system; multimodal meeting summarization; term frequency-inverse document frequency measure; text analysis; video skims; visual activity analysis; Audio recording; Event detection; Frequency; Image analysis; Image motion analysis; Image sequence analysis; Performance analysis; Speech analysis; Text analysis; Video recording;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2003. ICME '03. Proceedings. 2003 International Conference on
  • Print_ISBN
    0-7803-7965-9
  • Type

    conf

  • DOI
    10.1109/ICME.2003.1221239
  • Filename
    1221239