Title :
Multi-Modal Video Concept Extraction Using Co-Training
Author :
Yan, Rong ; Naphade, Milind
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA
Abstract :
For large scale automatic semantic video characterization, it is necessary to learn and model a large number of semantic concepts. A major obstacle to this is the insufficiency of labeled training samples. Semi-supervised learning algorithms such as co-training may help by incorporating a large amount of unlabeled data, which allows the redundant information across views to improve the learning performance. Although co-training has been successfully applied in several domains, it has not been used to detect video concepts before. In this paper, we extend co-training to the domain of video concept detection and investigate different strategies of co-training as well as their effects to the detection accuracy. We demonstrate performance based on the guideline of the TRECVID ´03 semantic concept extraction task
Keywords :
feature extraction; image classification; learning (artificial intelligence); semantic networks; video signal processing; TRECVID ´03 semantic concept; cotraining algorithm; multimodal video concept extraction; redundant information; semisupervised learning algorithm; Computer science; Drives; Government; Guidelines; Gunshot detection systems; Large-scale systems; Semisupervised learning; Speech; Statistical learning; Streaming media;
Conference_Titel :
Multimedia and Expo, 2005. ICME 2005. IEEE International Conference on
Conference_Location :
Amsterdam
Print_ISBN :
0-7803-9331-7
DOI :
10.1109/ICME.2005.1521473