Title :
Structuring lecture videos for distance learning applications
Author :
Ngo, Chong-Wah ; Wang, Feng ; Pong, Ting-Chuen
Author_Institution :
Dept. of Comput. Sci., City Univ. of Hong Kong, Kowloon, China
Abstract :
We present an automatic and novel approach in structuring and indexing lecture videos for distance learning applications. By structuring video content, we can support both topic indexing and semantic querying of multimedia documents. our aim is to link the discussion topics extracted from the electronic slides with their associated video and audio segments. Two major techniques in our proposed approach include video text analysis and speech recognition. Initially, a video is partitioned into shots based on slide transitions. For each shot, the embedded video texts are detected, reconstructed and segmented as high-resolution foreground texts for commercial OCR recognition. The recognized texts can then be matched with their associated slides for video indexing. Meanwhile, both phrases (title) and keywords (content) are also extracted from the electronic slides to spot the speech signals. The spotted phrases and keywords are further utilized as queries to retrieve the most similar slide for speech indexing.
Keywords :
distance learning; educational technology; image recognition; multimedia computing; optical character recognition; speech recognition; video signal processing; OCR recognition; distance learning applications; lecture video structuring; multimedia documents; semantic querying; speech indexing; speech recognition; topic indexing; video indexing; video text analysis; Application software; Computer aided instruction; Computer science; Gunshot detection systems; Indexing; Joining processes; Streaming media; Text analysis; Text recognition; Videos;
Conference_Titel :
Multimedia Software Engineering, 2003. Proceedings. Fifth International Symposium on
Conference_Location :
Taichung, Taiwan
Print_ISBN :
0-7695-2031-6
DOI :
10.1109/MMSE.2003.1254444