DocumentCode
3355710
Title
Improving image similarity measures for image browsing and retrieval through latent space learning between images and long texts
Author
Ushiku, Yoshitaka ; Harada, Tatsuya ; Kuniyoshi, Yasuo
Author_Institution
Grad. Sch. of Inf. Sci. & Technol., Univ. of Tokyo, Tokyo, Japan
fYear
2010
fDate
26-29 Sept. 2010
Firstpage
2365
Lastpage
2368
Abstract
The amount of multimedia data on personal devices and the Web is increasing daily. Image browsing and retrieval systems in a low-dimensional space have been widely studied to manage and view large numbers of images. It is essential for such systems to exploit an efficient similarity measure of the images when searching for them. Existing methods use the distance in a low-level image feature space as the similarity measure, and therefore, images with different content may be treated as similar images. In this paper, we propose a novel method to improve the similarity measures for images by considering the text surrounding the images. If there is text describing the images, similarities can be measured more effectively by taking into account the text streams. The proposed method improves the image similarity measures based on the latent semantics obtained from the combination of image and text. It should be noted that the text does not need to be clear tags; indeed, any generic Web text is applicable. Moreover, our method can effectively improve the similarities even if only a small portion of the images include textual descriptions. Additionally, the proposed method is scalable as it has linear computational complexity based on the number of images. In the experiments, we compare our method with previous methods using an original dataset in which a portion of the images are annotated by long text. We show that the proposed method can retrieve semantically similar images more precisely than existing methods.
Keywords
Internet; computational complexity; image retrieval; multimedia computing; World Wide Web; computational complexity; image browsing; image retrieval; image similarity measures; latent space learning; low-dimensional space; low-level image feature space; multimedia data; personal devices; retrieval systems; text streams; Accuracy; Correlation; Feature extraction; Image retrieval; Layout; Principal component analysis; Semantics; Similarity measure; image retrieval and browsing; multimodal learning; pCCA; semantic gap;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing (ICIP), 2010 17th IEEE International Conference on
Conference_Location
Hong Kong
ISSN
1522-4880
Print_ISBN
978-1-4244-7992-4
Electronic_ISBN
1522-4880
Type
conf
DOI
10.1109/ICIP.2010.5652827
Filename
5652827
Link To Document