Title :
De-duplication of photograph images using histogram refinement
Author :
Ramaiah, N. Pattabhi ; Mohan, C. Krishna
Author_Institution :
Dept. of Comput. Sci. & Eng., Indian Inst. of Technol. Hyderabad, Hyderabad, India
Abstract :
Content based image retrieval (CBIR), a technique which uses the content like color, texture and shape to search images from the large scale databases, is an active research area. In this paper, de-duplication process of photographs was implemented using CBIR. The CBIR technique uses color histogram refinement feature. The photograph data was divided into different clusters using k-means clustering algorithm. The clusters count depends on the numbers of photographs in each district of the state. The photo de-duplication exercise was carried out in a large photograph database which contains 22 million (approximately) photograph images. The experimental results shows that there were 0.35 million (approximately) duplicate photographs.
Keywords :
content-based retrieval; image colour analysis; image retrieval; pattern clustering; CBIR technique; color histogram refinement feature; content based image retrieval; k-means clustering algorithm; photograph database; photograph image deduplication; Clustering algorithms; Databases; Feature extraction; Histograms; Image color analysis; Vectors; Wavelet transforms; Daubechies-4 wavelet transform; color; de-duplication; histogram refinement; k-means clustering; texture;
Conference_Titel :
Recent Advances in Intelligent Computational Systems (RAICS), 2011 IEEE
Conference_Location :
Trivandrum
Print_ISBN :
978-1-4244-9478-1
DOI :
10.1109/RAICS.2011.6069341