• DocumentCode
    2987631
  • Title

    Supervised models for multimodal image retrieval based on visual, semantic and geographic information

  • Author

    Dang-Nguyen, Duc-Tien ; Boato, Giulia ; Moschitti, Alessandro ; De Natale, Francesco G B

  • Author_Institution
    Dept. of Inf. & Comput. Sci., Univ. of Trento, Trento, Italy
  • fYear
    2012
  • fDate
    27-29 June 2012
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Nowadays, large-scale networked social media need better search technologies to achieve suitable performance. Multimodal approaches are promising technologies to improve image ranking. This is particularly true when metadata are not completely reliable, which is a rather common case as far as user annotation, time and location are concerned. In this paper, we propose to properly combine visual information with additional multi-faceted information, to define a novel multimodal similarity measure. More specifically, we combine visual features, which strongly relate to the image content, with semantic information represented by manually annotated concepts, and geo tagging, very often available in the form of object/subject location. Furthermore, we propose a supervised machine learning approach, based on Support Vector Machines (SVMs), to automatically learn optimized weights to combine the above features. The resulting models is used as a ranking function to sort the results of a multimodal query.
  • Keywords
    image retrieval; learning (artificial intelligence); social networking (online); support vector machines; SVM; geo tagging; geographic information; image content; image ranking; large-scale networked social media; manually annotated concepts; meta data; multifaceted information; multimodal image retrieval; multimodal query; multimodal similarity measure; ranking function; search technologies; semantic information; supervised machine learning approach; supervised models; support vector machines; user annotation; visual information; Accuracy; Global Positioning System; Image retrieval; Reliability; Semantics; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Content-Based Multimedia Indexing (CBMI), 2012 10th International Workshop on
  • Conference_Location
    Annecy
  • ISSN
    1949-3983
  • Print_ISBN
    978-1-4673-2368-0
  • Electronic_ISBN
    1949-3983
  • Type

    conf

  • DOI
    10.1109/CBMI.2012.6269806
  • Filename
    6269806