DocumentCode
2763468
Title
Field-data grouping for form document processing using a gravitation-based algorithm
Author
Chen, Jiun-Lin ; Lee, Hsi-Jian
Author_Institution
Dept. of Comput. Sci. & Inf. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume
2
fYear
1998
fDate
16-20 Aug 1998
Firstpage
1095
Abstract
This paper presents a novel approach to grouping Chinese handwritten field-data of form documents using a gravitation-based algorithm. We develop an algorithm to extract handwritten field data which may be written out of the fields. We first extract and remove form lines for input form images. Next, we detect connected-components from remaining data, and compute the gravitation for each connected-component by using the black pixel counts as their mass. Then, we move connected-components to their field center according to their gravitation, since filled-in data have the locality property, that is, data of the same field are usually written in a local area consecutively. After moving these connected-components for a certain times, we can assign most components to the fields where they should be. Thus, we can determine which connected-components should be extracted for a particular field. Experimental results show that this proposed method can group field-data effectively
Keywords
document image processing; feature extraction; handwritten character recognition; connected-component; document image processing; feature extraction; field-data grouping; form document processing; gravitation-based algorithm; handwritten Chinese characters; Data mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
Conference_Location
Brisbane, Qld.
ISSN
1051-4651
Print_ISBN
0-8186-8512-3
Type
conf
DOI
10.1109/ICPR.1998.711884
Filename
711884
Link To Document