• DocumentCode
    3518683
  • Title

    Latent semantic retrieval of personal photos with sparse user annotation by fused image/speech/text features

  • Author

    Fu, Yi-Sheng ; Wan, Chia-yu ; Lee, Lin-shan

  • Author_Institution
    Grad. Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    1969
  • Lastpage
    1972
  • Abstract
    While users prefer high-level semantic photo descriptions (e.g., who, what, when, where), we wish to minimize the need to annotate photos using such descriptions by the user. We propose a latent semantic personal photo retrieval approach using fused image/speech/text features. We use low-level image features to derive relationships among sparsely annotated photos, and probabilistic latent semantic analysis (PLSA) models based on fused image/speech/text features to analyze photo ldquotopicsrdquo. We then retrieve the photos using text or speech queries of simple high-level semantic words only. In preliminary experiments, while only 10% of the photos were manually annotated, the photos could be well retrieved with very encouraging results.
  • Keywords
    content-based retrieval; image retrieval; text analysis; fused image/speech/text features; image features; latent semantic personal photo retrieval; latent semantic retrieval; personal photos; probabilistic latent semantic analysis models; semantic photo descriptions; sparse user annotation; sparsely annotated photos; speech query; text query; Computer science; Content based retrieval; Digital cameras; Image analysis; Image retrieval; Indexing; Information retrieval; Labeling; Large scale integration; Speech analysis; fused features; image retrieval; latent topics; semantic analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4959997
  • Filename
    4959997