• DocumentCode
    2930402
  • Title

    Multimodal pLSA on visual features and tags

  • Author

    Romberg, Stefan ; Hörster, Eva ; Lienhart, Rainer

  • Author_Institution
    Multimedia Comput. Lab., Univ. of Augsburg, Augsburg, Germany
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    414
  • Lastpage
    417
  • Abstract
    This work studies a new approach for image retrieval on largescale community databases. Our proposed system explores two different modalities: visual features and community-generated metadata, such as tags. We use topic models to derive a high-level representation appropriate for retrieval for each of our images in the database. We evaluate the proposed approach experimentally in a query-by-example retrieval task and compare our results to systems relying solely on visual features or tag features. It is shown that the proposed multimodal system outperforms the unimodal systems by approximately 36%.
  • Keywords
    image representation; image retrieval; meta data; probability; visual databases; community-generated metadata; high-level representation; image retrieval; image tagging; largescale community database; multimodal pLSA; multimodal system; probabilistic latent semantic analysis; query-by-example retrieval task; topic model; visual feature; Feature extraction; Image databases; Image retrieval; Information retrieval; Large-scale systems; Multimedia computing; Spatial databases; Technical Activities Guide -TAG; Visual databases; Vocabulary; SIFT; image retrieval; multimodal pLSA; tags;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202522
  • Filename
    5202522