• DocumentCode
    822210
  • Title

    Learning Image-Text Associations

  • Author

    Jiang, Tao ; Tan, Ah-Hwee

  • Author_Institution
    ecPresence Technol. Pte Ltd., Singapore
  • Volume
    21
  • Issue
    2
  • fYear
    2009
  • Firstpage
    161
  • Lastpage
    177
  • Abstract
    Web information fusion can be defined as the problem of collating and tracking information related to specific topics on the World Wide Web. Whereas most existing work on Web information fusion has focused on text-based multidocument summarization, this paper concerns the topic of image and text association, a cornerstone of cross-media Web information fusion. Specifically, we present two learning methods for discovering the underlying associations between images and texts based on small training data sets. The first method based on vague transformation measures the information similarity between the visual features and the textual features through a set of predefined domain-specific information categories. Another method uses a neural network to learn direct mapping between the visual and textual features by automatically and incrementally summarizing the associated features into a set of information templates. Despite their distinct approaches, our experimental results on a terrorist domain document set show that both methods are capable of learning associations between images and texts from a small training data set.
  • Keywords
    Internet; data mining; neural nets; Web information fusion; image-text association; learning method; neural network; textual feature; visual feature; Data mining; Multimedia Information Systems; Multimedia databases; image-text association mining.; multimedia data mining;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.150
  • Filename
    4585378