DocumentCode
2139425
Title
Towards the detection and the characterization of conversational speech zones in audiovisual documents
Author
Bigot, Benjamin ; Ferrane, Isabelle ; Ibrahim, Zein Al Abidin
Author_Institution
IRIT - Paul Sabatier Univ., Toulouse
fYear
2008
fDate
18-20 June 2008
Firstpage
162
Lastpage
169
Abstract
Giving access to the semantically rich content of large amounts of digital audiovisual data using an automatic and generic method is still an important challenge. The aim of our work is to address this issue while focusing on temporal aspects. Our approach is based on a method previously developed for analyzing temporal relations from a data mining point of view. This method is used to detect zones of a document in which two characteristics are active. These characteristics can result from low-level segmentations of the audio or video components, or from more semantic processings. Once ldquoactivity zonesrdquo have been detected, we propose to compute a set of additional descriptors in order to better characterize them. The method is applied in the scope of the EPAC project that focuses on the detection and the characterization of conversational speech.
Keywords
audio-visual systems; data mining; document handling; speech recognition; audio component segmentation; audiovisual documents; data mining; digital audiovisual data; semantic processings; speech detection; video component segmentation; Aggregates; Content based retrieval; Data mining; Face detection; Image color analysis; Image segmentation; Indexing; Information retrieval; Speech analysis; Streaming media;
fLanguage
English
Publisher
ieee
Conference_Titel
Content-Based Multimedia Indexing, 2008. CBMI 2008. International Workshop on
Conference_Location
London
Print_ISBN
978-1-4244-2043-8
Electronic_ISBN
978-1-4244-2044-5
Type
conf
DOI
10.1109/CBMI.2008.4564942
Filename
4564942
Link To Document