Title :
Multi-modal classification in digital news libraries
Author :
Chen, Ming-yu ; Hauptmann, Alexander
Author_Institution :
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
Abstract :
This paper describes a comprehensive approach to construct robust multimodal video classification on a specific digital source, broadcast news. Broadcast news has a very stable structure and every segment has its specific purpose. Video classification can support fundamental understanding of the structure of the video and the content. The variety of video content makes it hard to classify; however, it also provides multimodal information. Our approach tries to solve two important issues of multimodal classification. The first one is to select few discriminative features from many raw features and the second one is to efficiently combine multiple sources. We applied Fisher´s Linear Discriminant (FLD) for feature selection and concatenated the projections into a single synthesized feature vector as the combination strategy. Experimental results on the 2003 TRECVID news video archive show that our approach achieves very robust and accurate performance.
Keywords :
content management; digital libraries; feature extraction; image classification; image retrieval; information resources; video databases; Fisher Linear Discriminant; TRECVID news video archive; broadcast news; digital news library; digital source; feature selection; feature vector; multimodal video classification; video content; Computer science; Digital video broadcasting; Eigenvalues and eigenfunctions; Image segmentation; Layout; Optimized production technology; Robustness; Software libraries; Support vector machine classification; Support vector machines;
Conference_Titel :
Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE Conference on
Print_ISBN :
1-58113-832-6
DOI :
10.1109/JCDL.2004.1336122