• DocumentCode
    569136
  • Title

    Efficient Tag Mining via Mixture Modeling for Real-Time Search-Based Image Annotation

  • Author

    Dai, Lican ; Wang, Xin-Jing ; Zhang, Lei ; Yu, Nenghai

  • Author_Institution
    Dept. of EEIS, Univ. of Sci. & Technol. of China, Hefei, China
  • fYear
    2012
  • fDate
    9-13 July 2012
  • Firstpage
    134
  • Lastpage
    139
  • Abstract
    Although it has been extensively studied for many years, automatic image annotation is still a challenging problem. Recently, data-driven approaches have demonstrated their great success to image auto-annotation. Such approaches leverage abundant partially annotated web images to annotate an uncaptioned image. Specifically, they first retrieve a group of visually closely similar images given an uncaptioned image as a query, then figure out meaningful phrases from the surrounding texts of the image search results. Since the surrounding texts are generally noisy, how to effectively mine meaningful phrases is crucial for the success of such approaches. We propose a mixture modeling approach which assumes that a tag is generated from a convex combination of topics. Different from a typical topic modeling approach like LDA, topics in our approach are explicitly learnt from a definitive catalog of the Web, i.e. the Open Directory Project (ODP). Compared with previous works, it has two advantages: Firstly, it uses an open vocabulary rather than a limited one defined by a training set. Secondly, it is efficient for real-time annotation. Experimental results conducted on two billion web images show the efficiency and effectiveness of the proposed approach.
  • Keywords
    data mining; image retrieval; text analysis; vocabulary; ODP; annotated Web images; automatic image annotation; data-driven approaches; image search results; mixture modeling approach; open directory project; open vocabulary; real-time search-based image annotation; surrounding texts; tag mining; uncaptioned image; Indexes; Noise measurement; Real time systems; Semantics; Training; Vectors; Vocabulary; Search based image annotation; Tag mining; Topic space modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo (ICME), 2012 IEEE International Conference on
  • Conference_Location
    Melbourne, VIC
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4673-1659-0
  • Type

    conf

  • DOI
    10.1109/ICME.2012.104
  • Filename
    6298387