Title :
Large-Scale Duplicate Detection for Web Image Search
Author :
Wang, Bin ; Li, Zhiwei ; Li, Mingjing ; Ma, Wei-Ying
Author_Institution :
Univ. of Sci. & Technol. of China, Hefei
Abstract :
Finding visually identical images in large image collections is important for many applications such as intelligence propriety protection and search result presentation. Several algorithms have been reported in the literature, but they are not suitable for large image collections. In this paper, a novel algorithm is proposed to handle the situation, in which each image is compactly represented by a hash code. To detect duplicate images, only the hash codes are required. In addition, a very efficient search method is implemented to quickly group images with similar hash codes for fast detection. The experiments show that our algorithm can be both efficient and effective for duplicate detection in Web image search
Keywords :
Web sites; codes; image representation; large-scale systems; search engines; Web image search; hash code; large-scale duplicate image detection; Asia; Costs; Digital cameras; Image converters; Image storage; Internet; Large-scale systems; Protection; Search engines; Uniform resource locators;
Conference_Titel :
Multimedia and Expo, 2006 IEEE International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0366-7
Electronic_ISBN :
1-4244-0367-7
DOI :
10.1109/ICME.2006.262509