Title :
Tag-Based Image Retrieval Improved by Augmented Features and Group-Based Refinement
Author :
Chen, Lin ; Xu, Dong ; Tsang, Ivor W. ; Luo, Jiebo
Author_Institution :
Centre for Multimedia & Network Technol. (CeMNet), Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this paper, we propose a new tag-based image retrieval framework to improve the retrieval performance of a group of related personal images captured by the same user within a short period of an event by leveraging millions of training web images and their associated rich textual descriptions. For any given query tag (e.g., “car”), the inverted file method is employed to automatically determine the relevant training web images that are associated with the query tag and the irrelevant training web images that are not associated with the query tag. Using these relevant and irrelevant web images as positive and negative training data respectively, we propose a new classification method called support vector machine (SVM) with augmented features (AFSVM) to learn an adapted classifier by leveraging the prelearned SVM classifiers of popular tags that are associated with a large number of relevant training web images. Treating the decision values of one group of test photos from AFSVM classifiers as the initial relevance scores, in the subsequent group-based refinement process, we propose to use the Laplacian regularized least squares method to further refine the relevance scores of test photos by utilizing the visual similarity of the images within the group. Based on the refined relevance scores, our proposed framework can be readily applied to tag-based image retrieval for a group of raw consumer photos without any textual descriptions or a group of Flickr photos with noisy tags. Moreover, we propose a new method to better calculate the relevance scores for Flickr photos. Extensive experiments on two datasets demonstrate the effectiveness of our framework.
Keywords :
Internet; feature extraction; image classification; image retrieval; learning (artificial intelligence); least squares approximations; social networking (online); support vector machines; text analysis; AFSVM classifiers; Flickr photos; Laplacian regularized least squares method; adapted classifier learning; augmented features; classification method; datasets; decision values; group-based refinement process; inverted file method; irrelevant training Web images; negative training data; noisy tags; personal images; positive training data; query tag; raw consumer photos; relevance score refinement; relevant training Web images; rich textual descriptions; support vector machine with augmented features; tag-based image retrieval performance improvement; test photos; visual similarity; Image retrieval; Noise measurement; Semantics; Snow; Support vector machines; Training; Training data; Group-based refinement; Laplacian regularized least squares (LapRLS); support vector machine (SVM) with augmented features (AFSVM); tag-based image retrieval;
Journal_Title :
Multimedia, IEEE Transactions on
DOI :
10.1109/TMM.2012.2187435