• DocumentCode
    594700
  • Title

    Cross-modal topic correlations for multimedia retrieval

  • Author

    Jing Yu ; Yonghui Cong ; Zengchang Qin ; Tao Wan

  • Author_Institution
    Intell. Comput. & Machine Learning Lab., Beihang Univ., Beijing, China
  • fYear
    2012
  • fDate
    11-15 Nov. 2012
  • Firstpage
    246
  • Lastpage
    249
  • Abstract
    In this paper, we propose a novel approach for cross-modal multimedia retrieval by jointly modeling the text and image components of multimedia documents. In this model, the image component is represented by local SIFT descriptors based on the bag-of-feature model. The text component is represented by a topic distribution learned from latent topic models such as latent Dirichlet allocation (LDA). The latent semantic relations between texts and images can be reflected by correlations between the word topics and topics of image features. A statistical correlation model conditioned on category information is investigated. Experimental results on a benchmark Wikipedia dataset show that the newly proposed approach outperforms state-of-the-art cross-modal multimedia retrieval systems.
  • Keywords
    image processing; information retrieval; multimedia systems; statistical analysis; text analysis; LDA; Wikipedia dataset; bag-of-feature model; cross-modal topic correlations; image components; latent Dirichlet allocation; local SIFT descriptors; multimedia documents; multimedia retrieval; statistical correlation; text components; Correlation; Electronic publishing; Encyclopedias; Internet; Multimedia communication; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2012 21st International Conference on
  • Conference_Location
    Tsukuba
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4673-2216-4
  • Type

    conf

  • Filename
    6460118