Title :
Content Adaptive Hash Lookups for Near-Duplicate Image Search by Full or Partial Image Queries
Author :
Harmanci, O. ; Haritaoglu, I.
Author_Institution :
Anvato, Inc., Mountain View, CA, USA
Abstract :
In this paper we present a scalable and high performance near-duplicate image search method. The proposed algorithm follows the common paradigm of computing local features around repeatable scale invariant interest points. Unlike existing methods, much shorter hashes are used (40 bits). By leveraging on the shortness of the hashes, a novel high performance search algorithm is introduced which analyzes the reliability of each bit of a hash and performs content adaptive hash lookups by adaptively adjusting the "range" of each hash bit based on reliability. Matched features are post-processed to determine the final match results. We experimentally show that the algorithm can detect cropped, resized, print-scanned and re-encoded images and pieces from images among thousands of images. The proposed algorithm can search for a 200×200 piece of image in a database of 2,250 images with size 2400×4000 in 0.020 seconds on 2.5GHz Intel Core 2.
Keywords :
file organisation; image coding; image retrieval; content adaptive hash lookups; image queries; near-duplicate image search; reencoded images; reliability; Databases; Feature extraction; Noise; Quantization; Robustness; Vectors; content adaptive hash lookup; multimedia indexing; multimedia retrieval; near-duplicate search;
Conference_Titel :
Pattern Recognition (ICPR), 2010 20th International Conference on
Conference_Location :
Istanbul
Print_ISBN :
978-1-4244-7542-1
DOI :
10.1109/ICPR.2010.391