Title :
Unsupervised Semantic Feature Discovery for Image Object Retrieval and Tag Refinement
Author :
Kuo, Yin-Hsi ; Cheng, Wen-Huang ; Lin, Hsuan-Tien ; Hsu, Winston H.
Author_Institution :
Grad. Inst. of Networking & Multimedia, Nat. Taiwan Univ., Taipei, Taiwan
Abstract :
We have witnessed the exponential growth of images and videos with the prevalence of capture devices and the ease of social services such as Flickr and Facebook. Meanwhile, enormous media collections are along with rich contextual cues such as tags, geo-locations, descriptions, and time. To obtain desired images, users usually issue a query to a search engine using either an image or keywords. Therefore, the existing solutions for image retrieval rely on either the image contents (e.g., low-level features) or the surrounding texts (e.g., descriptions, tags) only. Those solutions usually suffer from low recall rates because small changes in lighting conditions, viewpoints, occlusions, or (missing) noisy tags can degrade the performance significantly. In this work, we tackle the problem by leveraging both the image contents and associated textual information in the social media to approximate the semantic representations for the two modalities. We propose a general framework to augment each image with relevant semantic (visual and textual) features by using graphs among images. The framework automatically discovers relevant semantic features by propagation and selection in textual and visual image graphs in an unsupervised manner. We investigate the effectiveness of the framework when using different optimization methods for maximizing efficiency. The proposed framework can be directly applied to various applications, such as keyword-based image search, image object retrieval, and tag refinement. Experimental results confirm that the proposed framework effectively improves the performance of these emerging image retrieval applications.
Keywords :
feature extraction; graph theory; image retrieval; image texture; lighting; optimisation; search engines; social networking (online); word processing; automatic relevant semantic feature discovery; efficiency maximization; geo-locations; image object retrieval; image querying; keyword-based image search; lighting conditions; media collections; noisy tags; occlusions; optimization methods; performance degradation; search engine; semantic representations; social media; social services; tag refinement; textual image graphs; textual information; unsupervised semantic feature discovery; viewpoints; visual image graphs; Accuracy; Electronic mail; Image retrieval; Media; Semantics; Visualization; Vocabulary; Image graph; image object retrieval; semantic feature discovery; tag refinement;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2012.2190386