Improving Web Image Search by Bag-Based Reranking

Author

Duan, Lixin ; Li, Wen ; Tsang, Ivor Wai-Hung ; Xu, Dong

Author_Institution

Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore

Volume

20

Issue

11

fYear

2011

Firstpage

3280

Lastpage

3290

Abstract

Given a textual query in traditional text-based image retrieval (TBIR), relevant images are to be reranked using visual features after the initial text-based search. In this paper, we propose a new bag-based reranking framework for large-scale TBIR. Specifically, we first cluster relevant images using both textual and visual features. By treating each cluster as a “bag” and the images in the bag as “instances,” we formulate this problem as a multi-instance (MI) learning problem. MI learning methods such as mi-SVM can be readily incorporated into our bag-based reranking framework. Observing that at least a certain portion of a positive bag is of positive instances while a negative bag might also contain positive instances, we further use a more suitable generalized MI (GMI) setting for this application. To address the ambiguities on the instance labels in the positive and negative bags under this GMI setting, we develop a new method referred to as GMI-SVM to enhance retrieval performance by propagating the labels from the bag level to the instance level. To acquire bag annotations for (G)MI learning, we propose a bag ranking method to rank all the bags according to the defined bag ranking score. The top ranked bags are used as pseudopositive training bags, while pseudonegative training bags can be obtained by randomly sampling a few irrelevant images that are not associated with the textual query. Comprehensive experiments on the challenging real-world data set NUS-WIDE demonstrate our framework with automatic bag annotation can achieve the best performances compared with existing image reranking methods. Our experiments also demonstrate that GMI-SVM can achieve better performances when using the manually labeled training bags obtained from relevance feedback.

Keywords

Internet; image retrieval; image texture; learning (artificial intelligence); search engines; support vector machines; Web image search; automatic bag annotation; bag ranking score; bag-based reranking framework; generalized multiinstance learning; mi-SVM; negative bags; positive bags; positive instances; pseudonegative training bags; pseudopositive training bags; real-world data set NUS-WIDE; relevance feedback; text-based image retrieval; text-based search; textual features; visual features; Bismuth; Image retrieval; Kernel; Learning systems; Support vector machines; Training; Visualization; Bag-based image reranking; generalized multi-instance (GMI) learning; text-based image retrieval (TBIR); Algorithms; Databases, Factual; Image Enhancement; Image Interpretation, Computer-Assisted; Internet; Pattern Recognition, Automated; Semantics;

fLanguage

English

Journal_Title

Image Processing, IEEE Transactions on

Publisher

ieee

ISSN

1057-7149

Type

jour

DOI

10.1109/TIP.2011.2159227

Filename

5872043