• DocumentCode
    2930188
  • Title

    Learning image semantics with latent aspect model

  • Author

    Li, Zhixin ; Liu, Xi ; Shi, Zhiping ; Shi, Zhongzhi

  • Author_Institution
    Key Lab. of Intell. Inf. Process., Chinese Acad. of Sci., Beijing, China
  • fYear
    2009
  • fDate
    June 28 2009-July 3 2009
  • Firstpage
    366
  • Lastpage
    369
  • Abstract
    Automatic image annotation has become an important and challenging problem due to the existence of semantic gap. In this paper, we present an approach based on probabilistic latent semantic analysis (PLSA) to accomplish the tasks of semantic image annotation and retrieval. In order to model training images precisely, we employ two PLSA models to capture semantic information from visual and textual modalities respectively. Then an adaptive asymmetric learning approach is proposed to fuse aspects which are learned from both modalities. For each image document, the weight of each modality is determined by its contribution to the content of the image. Consequently, the two models are linked with the same distribution over aspects. This structure can predict semantic annotation for an unseen image because it associates visual and textual modalities properly. Finally, we compare our approach with several previous approaches on a standard Corel dataset. The experiment results show that our approach performs more effective and accurate.
  • Keywords
    image retrieval; information analysis; learning (artificial intelligence); asymmetric learning approach; automatic image annotation; image document; image retrieval; image semantic learning; latent aspect model; probabilistic latent semantic analysis; semantic image annotation; standard Corel dataset; textual modalities; visua modalities; Feature extraction; Fuses; Hidden Markov models; Image databases; Image retrieval; Indexing; Information processing; Laboratories; Spatial databases; Visual databases; PLSA; adaptive asymmetric learning; aspect model; automatic image annotation; image retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia and Expo, 2009. ICME 2009. IEEE International Conference on
  • Conference_Location
    New York, NY
  • ISSN
    1945-7871
  • Print_ISBN
    978-1-4244-4290-4
  • Electronic_ISBN
    1945-7871
  • Type

    conf

  • DOI
    10.1109/ICME.2009.5202510
  • Filename
    5202510