Title :
Retrieval of efficiently classified, re-ranked images using histogram based score computation algorithm extended with the elimination of duplicate images
Author :
Shinde, Parag N. ; Manjrekar, A.A.
Author_Institution :
Dept. of Technol., Comput. Sci. & Technol., Kolhapur, India
Abstract :
Internet Image search is a day to day activity performed by user. User enters a keyword in search engines like Google, Yahoo, Bing etc for retrieval of keyword related images, where millions of images are retrieved through search engines. The problem with a keyword search is that keywords entered by user are very short and ambiguous, hence images which are retrieved are of different categories and some of them are irrelevant. Visual information is used in order to solve the ambiguity in text based image retrieval. User only has to click on one query image. The query image is categorized based on textual features like image title, image URL, context, where a metadata corresponding to every image is extracted and also some visual features like histogram distance computation, SIFT, region based features are extracted. The query image selected by the user is first classified into a particular category and the images related to the query image are then retrieved by matching the class of query image and the class of other images. Using image clustering, classified images are clustered to group highly relevant images into one cluster and the keywords corresponding to the image clusters are extracted. The original keyword is extended by appending the extracted keyword with highest frequency. This gives more detail idea about user´s search intention. The images are then re-ranked using visual and textual similarity metrics. Duplicate images which are retrieved in search results are detected and eliminated by using SURF(Speeded Up Robust Feature) technique. The system is tested on variety of categories like person, scenery images at semantic level and other general categories like general objects, objects with simple background etc. The system is totally web based and works dynamically on any keyword given as a input by user.
Keywords :
Internet; feature extraction; image classification; image matching; image retrieval; pattern clustering; search engines; Internet image search; SURF technique; classified image retrieval; duplicate image elimination; histogram based score computation algorithm; image clustering; keyword related image retrieval; metadata; query image matching; reranked image retrieval; search engines; speeded up robust feature technique; text based image retrieval; textual features; textual similarity metrics; visual features; visual information; visual similarity metrics; Convergence; Feature extraction; Histograms; Image retrieval; Indexes; Visualization; Histogram; Image Retrieval; Nonduplicate; Re-Ranked; Scores; Visual Information;
Conference_Titel :
Convergence of Technology (I2CT), 2014 International Conference for
Print_ISBN :
978-1-4799-3758-5
DOI :
10.1109/I2CT.2014.7092036