Multi-modal classification in digital news libraries

Author

Chen, Ming-yu ; Hauptmann, Alexander

Author_Institution

Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA

fYear

2004

fDate

7-11 June 2004

Firstpage

212

Lastpage

213

Abstract

This paper describes a comprehensive approach to construct robust multimodal video classification on a specific digital source, broadcast news. Broadcast news has a very stable structure and every segment has its specific purpose. Video classification can support fundamental understanding of the structure of the video and the content. The variety of video content makes it hard to classify; however, it also provides multimodal information. Our approach tries to solve two important issues of multimodal classification. The first one is to select few discriminative features from many raw features and the second one is to efficiently combine multiple sources. We applied Fisher´s Linear Discriminant (FLD) for feature selection and concatenated the projections into a single synthesized feature vector as the combination strategy. Experimental results on the 2003 TRECVID news video archive show that our approach achieves very robust and accurate performance.

Keywords

content management; digital libraries; feature extraction; image classification; image retrieval; information resources; video databases; Fisher Linear Discriminant; TRECVID news video archive; broadcast news; digital news library; digital source; feature selection; feature vector; multimodal video classification; video content; Computer science; Digital video broadcasting; Eigenvalues and eigenfunctions; Image segmentation; Layout; Optimized production technology; Robustness; Software libraries; Support vector machine classification; Support vector machines;

fLanguage

English

Publisher

ieee

Conference_Titel

Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE Conference on

Print_ISBN

1-58113-832-6

Type

conf

DOI

10.1109/JCDL.2004.1336122

Filename

1336122