Title :
Graph-based multi-modal scene detection for movie and teleplay
Author :
Xu, Su ; Feng, Bailan ; Ding, Peng ; Xu, Bo
Author_Institution :
Digital Content Technol. Res. Center, Inst. of Autom., Beijing, China
Abstract :
Automatic scene detection is a fundamental step for efficient video searching and browsing. This paper presents our current work on scene detection that integrates three effective strategies into a single framework. For each video, firstly, a coherence signal is constructed by graph modal obtained from the similarity matrix in a temporal interval. Secondly, the signal is optimized by scene transition graph (STG) analysis and audio classification, in which scene clues hidden in multimedia are discovered from the video. Finally, the scene boundaries are identified by window function. In experiments, we compare the proposed scene detection method with three typical algorithms on teleplay and movies, and the results of our method, yielding an average 0.85 F-measure, is the best one.
Keywords :
graph theory; image classification; multimedia communication; video signal processing; audio classification; coherence signal; graph-based multimodal scene detection; movie; multimedia; scene boundaries; scene transition graph; similarity matrix; teleplay; video browsing; video searching; window function; Coherence; Feature extraction; Hidden Markov models; Motion pictures; Noise; Silicon; Visualization; STG analysis; audio classify; graph-modal; multi-modal;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288155