Title :
Latent semantic retrieval of personal photos with sparse user annotation by fused image/speech/text features
Author :
Fu, Yi-Sheng ; Wan, Chia-yu ; Lee, Lin-shan
Author_Institution :
Grad. Inst. of Comput. Sci. & Inf. Eng., Nat. Taiwan Univ., Taipei
Abstract :
While users prefer high-level semantic photo descriptions (e.g., who, what, when, where), we wish to minimize the need to annotate photos using such descriptions by the user. We propose a latent semantic personal photo retrieval approach using fused image/speech/text features. We use low-level image features to derive relationships among sparsely annotated photos, and probabilistic latent semantic analysis (PLSA) models based on fused image/speech/text features to analyze photo ldquotopicsrdquo. We then retrieve the photos using text or speech queries of simple high-level semantic words only. In preliminary experiments, while only 10% of the photos were manually annotated, the photos could be well retrieved with very encouraging results.
Keywords :
content-based retrieval; image retrieval; text analysis; fused image/speech/text features; image features; latent semantic personal photo retrieval; latent semantic retrieval; personal photos; probabilistic latent semantic analysis models; semantic photo descriptions; sparse user annotation; sparsely annotated photos; speech query; text query; Computer science; Content based retrieval; Digital cameras; Image analysis; Image retrieval; Indexing; Information retrieval; Labeling; Large scale integration; Speech analysis; fused features; image retrieval; latent topics; semantic analysis;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2009.4959997