DocumentCode :
1299639
Title :
Textual Query of Personal Photos Facilitated by Large-Scale Web Data
Author :
Liu, Yiming ; Xu, Dong ; Tsang, Ivor Wai-Hung ; Luo, Jiebo
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume :
33
Issue :
5
fYear :
2011
fDate :
5/1/2011 12:00:00 AM
Firstpage :
1022
Lastpage :
1036
Abstract :
The rapid popularization of digital cameras and mobile phone cameras has led to an explosive growth of personal photo collections by consumers. In this paper, we present a real-time textual query-based personal photo retrieval system by leveraging millions of Web images and their associated rich textual descriptions (captions, categories, etc.). After a user provides a textual query (e.g., "water”), our system exploits the inverted file to automatically find the positive Web images that are related to the textual query "water” as well as the negative Web images that are irrelevant to the textual query. Based on these automatically retrieved relevant and irrelevant Web images, we employ three simple but effective classification methods, k-Nearest Neighbor (kNN), decision stumps, and linear SVM, to rank personal photos. To further improve the photo retrieval performance, we propose two relevance feedback methods via cross-domain learning, which effectively utilize both the Web images and personal images. In particular, our proposed cross-domain learning methods can learn robust classifiers with only a very limited amount of labeled personal photos from the user by leveraging the prelearned linear SVM classifiers in real time. We further propose an incremental cross-domain learning method in order to significantly accelerate the relevance feedback process on large consumer photo databases. Extensive experiments on two consumer photo data sets demonstrate the effectiveness and efficiency of our system, which is also inherently not limited by any predefined lexicon.
Keywords :
Internet; image classification; image retrieval; support vector machines; text analysis; Web images; cross-domain learning; decision stumps; digital cameras; image classification method; k-nearest neighbor method; linear SVM; mobile phone cameras; personal photo ranking; real-time textual query based personal photo retrieval system; relevance feedback method; robust classifiers; Image retrieval; Learning systems; Real time systems; Robustness; Semantics; Support vector machines; Training; Textual query-based consumer photo retrieval; cross-domain learning.; large-scale Web data;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2010.142
Filename :
5551148
Link To Document :
بازگشت