DocumentCode
402499
Title
Line separation for complex document images using fuzzy runlength
Author
Shi, Zhixin ; Govindaraju, Venu
Author_Institution
Center of Excellence for Document Anal. & Recognition, State Univ. of New York, Buffalo, NY, USA
fYear
2004
fDate
2004
Firstpage
306
Lastpage
312
Abstract
A new text line location and separation algorithm for complex handwritten documents is proposed. The algorithm is based on the application of a fuzzy directional runlength. The proposed technique was tested on a variety of complex handwritten document images including postal parcel images and historical handwritten documents such as Newton´s and Galileo´s manuscripts. A preliminary testing showed a successful rate of 93% of the test set.
Keywords
document image processing; handwritten character recognition; history; pattern classification; runlength codes; text analysis; Galileo manuscripts; Newton manuscripts; complex handwritten document images; fuzzy runlength; historical handwritten documents; postal parcel images; text line separation algorithm; Character recognition; Computational efficiency; Data mining; Graphics; Histograms; Nearest neighbor searches; Optical character recognition software; Testing; Text analysis; Venus;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Image Analysis for Libraries, 2004. Proceedings. First International Workshop on
Print_ISBN
0-7695-2088-X
Type
conf
DOI
10.1109/DIAL.2004.1263259
Filename
1263259
Link To Document