Title :
Line separation for complex document images using fuzzy runlength
Author :
Shi, Zhixin ; Govindaraju, Venu
Author_Institution :
Center of Excellence for Document Anal. & Recognition, State Univ. of New York, Buffalo, NY, USA
Abstract :
A new text line location and separation algorithm for complex handwritten documents is proposed. The algorithm is based on the application of a fuzzy directional runlength. The proposed technique was tested on a variety of complex handwritten document images including postal parcel images and historical handwritten documents such as Newton´s and Galileo´s manuscripts. A preliminary testing showed a successful rate of 93% of the test set.
Keywords :
document image processing; handwritten character recognition; history; pattern classification; runlength codes; text analysis; Galileo manuscripts; Newton manuscripts; complex handwritten document images; fuzzy runlength; historical handwritten documents; postal parcel images; text line separation algorithm; Character recognition; Computational efficiency; Data mining; Graphics; Histograms; Nearest neighbor searches; Optical character recognition software; Testing; Text analysis; Venus;
Conference_Titel :
Document Image Analysis for Libraries, 2004. Proceedings. First International Workshop on
Print_ISBN :
0-7695-2088-X
DOI :
10.1109/DIAL.2004.1263259