DocumentCode :
2819095
Title :
Multimodal learning for multi-label image classification
Author :
Pang, Yanwei ; Ma, Zhao ; Yuan, Yuan ; Li, Xuelong ; Wang, Kongqiao
Author_Institution :
Dept. of Electr. Inf. Eng., Tianjin Univ., Tianjin, China
fYear :
2011
fDate :
11-14 Sept. 2011
Firstpage :
1797
Lastpage :
1800
Abstract :
We tackle the challenge of web image classification using additional tags information. Unlike traditional methods that only use the combination of several low-level features, we try to use semantic concepts to represent images and corresponding tags. At first, we extract the latent topic information by probabilistic latent semantic analysis (pLSA) algorithm, and then use multi-label multiple kernel learning to combine visual and textual features to make a better image classification. In our experiments on PASCAL VOC´07 set and MIR Flickr set, we demonstrate the benefit of using multimodal feature to improve image classification. Specifically, we discover that on the issue of image classification, utilizing latent semantic feature to represent images and associated tags can obtain better classification results than other ways that integrating several low-level features.
Keywords :
image classification; learning (artificial intelligence); probability; Web image classification; latent topic information; multilabel image classification; multilabel multiple kernel learning; multimodal learning; probabilistic latent semantic analysis; semantic concepts; tags information; Classification algorithms; Feature extraction; Image classification; Kernel; Semantics; Support vector machines; Visualization; Multilabel learning; Multimodal features; Multiple kernel learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing (ICIP), 2011 18th IEEE International Conference on
Conference_Location :
Brussels
ISSN :
1522-4880
Print_ISBN :
978-1-4577-1304-0
Electronic_ISBN :
1522-4880
Type :
conf
DOI :
10.1109/ICIP.2011.6115811
Filename :
6115811
Link To Document :
بازگشت