Title :
Fast Rule-Line Removal Using Integral Images and Support Vector Machines
Author :
Kumar, Jayant ; Doermann, David
Author_Institution :
Inst. of Adv. Comput. Studies, Univ. of Maryland, College Park, MD, USA
Abstract :
In this paper, we present a fast and effective method for removing pre-printed rule-lines in handwritten document images. We use an integral-image representation which allows fast computation of features and apply techniques for large scale Support Vector learning using a data selection strategy to sample a small subset of training data. Results on both constructed and real-world data sets show that the method is effective for rule-line removal. We compare our method to a subspace-based method and show that better accuracy can be achieved in considerably less time. The integral-image based features proposed in the paper are generic and can be applied to other problems as well.
Keywords :
document image processing; handwritten character recognition; image representation; learning (artificial intelligence); support vector machines; data selection strategy; fast rule-line removal; handwritten document images; integral-image representation; subspace-based method; support vector learning; support vector machines; Arrays; Feature extraction; Support vector machines; Text analysis; Training; Training data; Vectors; Arabic; Handwritten Documents; Rule-line;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.123