• DocumentCode
    2395862
  • Title

    Coherent image annotation by learning semantic distance

  • Author

    Mei, Tao ; Wang, Yong ; Hua, Xian-Sheng ; Gong, Shaogang ; Li, Shipeng

  • Author_Institution
    Microsoft Res. Asia, Beijing
  • fYear
    2008
  • fDate
    23-28 June 2008
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Conventional approaches to automatic image annotation usually suffer from two problems: (1) They cannot guarantee a good semantic coherence of the annotated words for each image, as they treat each word independently without considering the inherent semantic coherence among the words; (2) They heavily rely on visual similarity for judging semantic similarity. To address the above issues, we propose a novel approach to image annotation which simultaneously learns a semantic distance by capturing the prior annotation knowledge and propagates the annotation of an image as a whole entity. Specifically, a semantic distance function (SDF) is learned for each semantic cluster to measure the semantic similarity based on relative comparison relations of prior annotations. To annotate a new image, the training images in each cluster are ranked according to their SDF values with respect to this image and their corresponding annotations are then propagated to this image as a whole entity to ensure semantic coherence. We evaluate the innovative SDF-based approach on Corel images compared with Support Vector Machine-based approach. The experiments show that SDF-based approach outperforms in terms of semantic coherence, especially when each training image is associated with multiple words.
  • Keywords
    image processing; support vector machines; annotation knowledge; coherent image annotation; learning semantic distance; semantic coherence; semantic distance function; support vector machine; Asia; Computer science; Degradation; Digital cameras; Digital images; Image retrieval; Sun; Training data; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on
  • Conference_Location
    Anchorage, AK
  • ISSN
    1063-6919
  • Print_ISBN
    978-1-4244-2242-5
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2008.4587386
  • Filename
    4587386