DocumentCode
3518683
Title
Latent semantic retrieval of personal photos with sparse user annotation by fused image/speech/text features
Author
Fu, Yi-Sheng ; Wan, Chia-yu ; Lee, Lin-shan
Author_Institution
Grad. Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei
fYear
2009
fDate
19-24 April 2009
Firstpage
1969
Lastpage
1972
Abstract
While users prefer high-level semantic photo descriptions (e.g., who, what, when, where), we wish to minimize the need to annotate photos using such descriptions by the user. We propose a latent semantic personal photo retrieval approach using fused image/speech/text features. We use low-level image features to derive relationships among sparsely annotated photos, and probabilistic latent semantic analysis (PLSA) models based on fused image/speech/text features to analyze photo ldquotopicsrdquo. We then retrieve the photos using text or speech queries of simple high-level semantic words only. In preliminary experiments, while only 10% of the photos were manually annotated, the photos could be well retrieved with very encouraging results.
Keywords
content-based retrieval; image retrieval; text analysis; fused image/speech/text features; image features; latent semantic personal photo retrieval; latent semantic retrieval; personal photos; probabilistic latent semantic analysis models; semantic photo descriptions; sparse user annotation; sparsely annotated photos; speech query; text query; Computer science; Content based retrieval; Digital cameras; Image analysis; Image retrieval; Indexing; Information retrieval; Labeling; Large scale integration; Speech analysis; fused features; image retrieval; latent topics; semantic analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location
Taipei
ISSN
1520-6149
Print_ISBN
978-1-4244-2353-8
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2009.4959997
Filename
4959997
Link To Document