Author_Institution :
Electr. Eng. & Inf. Sci. Dept., Univ. of Sci. & Technol. of China, Hefei, China
Abstract :
With the explosive growth of the multimedia data on the Web, content-based image search has attracted considerable attentions in the multimedia and the computer vision community. The most popular approach is based on the bag-of-visual-words model with invariant local features. Since the spatial context information among local features is critical for visual content identification, many methods exploit the geometric clues of local features, including the location, the scale, and the orientation, for explicitly post-geometric verification. However, usually only a few initially top-ranked results are geometrically verified, considering the high computational cost in full geometric verification. In this paper, we propose to represent the spatial context of local features into binary codes, and implicitly achieve geometric verification by efficient comparison of the binary codes. Besides, we explore the multimode property of local features to further boost the retrieval performance. Experiments on holidays, Paris, and Oxford building benchmark data sets demonstrate the effectiveness of the proposed algorithm.
Keywords :
Internet; binary codes; computational geometry; computer vision; content-based retrieval; feature extraction; file organisation; image coding; image retrieval; multimedia computing; Oxford; Paris; Web; bag-of-visual-words model; benchmark data sets; binary codes; computer vision community; content-based image search; contextual hashing; geometric clues; high computational cost; invariant local features; large-scale image search; location; multimedia data; orientation; post-geometric verification; scale; spatial context information; visual content identification; Binary codes; Context; Feature extraction; Hamming distance; Indexing; Vectors; Visualization; BoVW; Image search; geometric verification; hashing; spatial context modeling;