• DocumentCode
    3408342
  • Title

    Harvesting large-scale weakly-tagged image databases from the web

  • Author

    Fan, Jianping ; Shen, Yi ; Zhou, Ning ; Gao, Yuli

  • Author_Institution
    Dept. of Comput. Sci., UNC-Charlotte, Charlotte, NC, USA
  • fYear
    2010
  • fDate
    13-18 June 2010
  • Firstpage
    802
  • Lastpage
    809
  • Abstract
    To leverage large-scale weakly-tagged images for computer vision tasks (such as object detection and scene recognition), a novel cross-modal tag cleansing and junk image filtering algorithm is developed for cleansing the weakly-tagged images and their social tags (i.e., removing irrelevant images and finding the most relevant tags for each image) by integrating both the visual similarity contexts between the images and the semantic similarity contexts between their tags. Our algorithm can address the issues of spams, polysemes and synonyms more effectively and determine the relevance between the images and their social tags more precisely, thus it can allow us to create large amounts of training images with more reliable labels by harvesting from large-scale weakly-tagged images, which can further be used to achieve more effective classifier training for many computer vision tasks.
  • Keywords
    Internet; computer vision; visual databases; Web; computer vision; cross modal tag cleansing; image filtering algorithm; large scale weakly tagged image database; Collaboration; Computer vision; Image databases; Image recognition; Internet; Large scale integration; Large-scale systems; Layout; Object detection; Tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-6984-0
  • Type

    conf

  • DOI
    10.1109/CVPR.2010.5540135
  • Filename
    5540135