DocumentCode
3408342
Title
Harvesting large-scale weakly-tagged image databases from the web
Author
Fan, Jianping ; Shen, Yi ; Zhou, Ning ; Gao, Yuli
Author_Institution
Dept. of Comput. Sci., UNC-Charlotte, Charlotte, NC, USA
fYear
2010
fDate
13-18 June 2010
Firstpage
802
Lastpage
809
Abstract
To leverage large-scale weakly-tagged images for computer vision tasks (such as object detection and scene recognition), a novel cross-modal tag cleansing and junk image filtering algorithm is developed for cleansing the weakly-tagged images and their social tags (i.e., removing irrelevant images and finding the most relevant tags for each image) by integrating both the visual similarity contexts between the images and the semantic similarity contexts between their tags. Our algorithm can address the issues of spams, polysemes and synonyms more effectively and determine the relevance between the images and their social tags more precisely, thus it can allow us to create large amounts of training images with more reliable labels by harvesting from large-scale weakly-tagged images, which can further be used to achieve more effective classifier training for many computer vision tasks.
Keywords
Internet; computer vision; visual databases; Web; computer vision; cross modal tag cleansing; image filtering algorithm; large scale weakly tagged image database; Collaboration; Computer vision; Image databases; Image recognition; Internet; Large scale integration; Large-scale systems; Layout; Object detection; Tagging;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
Conference_Location
San Francisco, CA
ISSN
1063-6919
Print_ISBN
978-1-4244-6984-0
Type
conf
DOI
10.1109/CVPR.2010.5540135
Filename
5540135
Link To Document