Title :
Integrated image and speech analysis for content-based video indexing
Author :
Chang, Yuh-Lin ; Zeng, Wenjun ; Kamel, Ibrahim ; Alonso, Rafael
Author_Institution :
Matsushita Inf. Technol. Lab., Panasonic Technol. Inc., Princeton, NJ, USA
Abstract :
We study an important problem in multimedia database, namely the automatic extraction of indexing information from raw data based on video contents. The goal of our research project is to develop a prototype system for automatic indexing of sports videos. The novelty of our work is that we propose to integrate speech understanding and image analysis algorithms for extracting information. The main thrust of this work comes from the observation that in news or sports video indexing, usually speech analysis is more efficient in detecting events than image analysis. Therefore, in our system, the audio processing modules are first applied to locate candidates in the whole data. This information is passed to the video processing modules, which further analyze the video. The final products of video analysis are in the form of pointers to the locations of interesting events in a video. Our algorithms have been tested extensively with real TV programs, and results are presented and discussed
Keywords :
audio-visual systems; information retrieval; interactive video; multimedia computing; sport; audio processing modules; automatic extraction; automatic indexing; content based video indexing; image analysis; image analysis algorithms; indexing information; interesting events; multimedia database; raw data; real TV programs; speech analysis; speech understanding; sports videos; video contents; video processing modules; Data mining; Event detection; Image analysis; Information analysis; Machine assisted indexing; Multimedia databases; Prototypes; Speech analysis; TV; Testing;
Conference_Titel :
Multimedia Computing and Systems, 1996., Proceedings of the Third IEEE International Conference on
Conference_Location :
Hiroshima
Print_ISBN :
0-8186-7438-5
DOI :
10.1109/MMCS.1996.534992