• DocumentCode
    2139425
  • Title

    Towards the detection and the characterization of conversational speech zones in audiovisual documents

  • Author

    Bigot, Benjamin ; Ferrane, Isabelle ; Ibrahim, Zein Al Abidin

  • Author_Institution
    IRIT - Paul Sabatier Univ., Toulouse
  • fYear
    2008
  • fDate
    18-20 June 2008
  • Firstpage
    162
  • Lastpage
    169
  • Abstract
    Giving access to the semantically rich content of large amounts of digital audiovisual data using an automatic and generic method is still an important challenge. The aim of our work is to address this issue while focusing on temporal aspects. Our approach is based on a method previously developed for analyzing temporal relations from a data mining point of view. This method is used to detect zones of a document in which two characteristics are active. These characteristics can result from low-level segmentations of the audio or video components, or from more semantic processings. Once ldquoactivity zonesrdquo have been detected, we propose to compute a set of additional descriptors in order to better characterize them. The method is applied in the scope of the EPAC project that focuses on the detection and the characterization of conversational speech.
  • Keywords
    audio-visual systems; data mining; document handling; speech recognition; audio component segmentation; audiovisual documents; data mining; digital audiovisual data; semantic processings; speech detection; video component segmentation; Aggregates; Content based retrieval; Data mining; Face detection; Image color analysis; Image segmentation; Indexing; Information retrieval; Speech analysis; Streaming media;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-2043-8
  • Electronic_ISBN
    978-1-4244-2044-5
  • Type

    conf

  • DOI
    10.1109/CBMI.2008.4564942
  • Filename
    4564942