DocumentCode
420215
Title
Multi-modal classification in digital news libraries
Author
Chen, Ming-yu ; Hauptmann, Alexander
Author_Institution
Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2004
fDate
7-11 June 2004
Firstpage
212
Lastpage
213
Abstract
This paper describes a comprehensive approach to construct robust multimodal video classification on a specific digital source, broadcast news. Broadcast news has a very stable structure and every segment has its specific purpose. Video classification can support fundamental understanding of the structure of the video and the content. The variety of video content makes it hard to classify; however, it also provides multimodal information. Our approach tries to solve two important issues of multimodal classification. The first one is to select few discriminative features from many raw features and the second one is to efficiently combine multiple sources. We applied Fisher´s Linear Discriminant (FLD) for feature selection and concatenated the projections into a single synthesized feature vector as the combination strategy. Experimental results on the 2003 TRECVID news video archive show that our approach achieves very robust and accurate performance.
Keywords
content management; digital libraries; feature extraction; image classification; image retrieval; information resources; video databases; Fisher Linear Discriminant; TRECVID news video archive; broadcast news; digital news library; digital source; feature selection; feature vector; multimodal video classification; video content; Computer science; Digital video broadcasting; Eigenvalues and eigenfunctions; Image segmentation; Layout; Optimized production technology; Robustness; Software libraries; Support vector machine classification; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE Conference on
Print_ISBN
1-58113-832-6
Type
conf
DOI
10.1109/JCDL.2004.1336122
Filename
1336122
Link To Document