Title :
A modular approach for query spotting in document images and its optimization using genetic algorithms
Author :
Chatbri, Houssem ; Kwan, Paul ; Kameyama, Keisuke
Author_Institution :
Dept. of Comput. Sci., Univ. of Tsukuba, Tsukuba, Japan
Abstract :
Query spotting in document images is a subclass of Content-Based Image Retrieval (CBIR) algorithms concerned with detecting occurrences of a query in a document image. Due to noise and complexity of document images, spotting can be a challenging task and easily prone to false positives and partially incorrect matches, thereby reducing the overall precision of the algorithm. A robust and accurate spotting algorithm is essential to our current research on sketch-based retrieval of digitized lecture materials. We have recently proposed a modular spotting algorithm in [1]. Compared to existing methods, our algorithm is both application-independent and segmentation-free. However, it faces the same challenges of noise and complexity of images. In this paper, inspired by our earlier research on optimizing parameter settings for CBIR using an evolutionary algorithm [2][3], we introduce a Genetic Algorithm-based optimization step in our spotting algorithm to improve each spotting result. Experiments using an image dataset of journal pages reveal promising performance, in that the precision is significantly improved but without compromising the recall of the overall spotting result.
Keywords :
content-based retrieval; document image processing; genetic algorithms; image retrieval; CBIR algorithm; content-based image retrieval; digitized lecture materials; document images; genetic algorithm; image complexity; image noise; modular approach; query spotting; sketch-based retrieval; spotting algorithm; Biological cells; Feature extraction; Genetic algorithms; Image segmentation; Optimization; Shape; Vectors;
Conference_Titel :
Evolutionary Computation (CEC), 2014 IEEE Congress on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6626-4
DOI :
10.1109/CEC.2014.6900475