DocumentCode
2630241
Title
The processing of form documents
Author
Doermann, David S. ; Rosenfeld, Azriel
Author_Institution
Center for Autom. Res., Maryland Univ., College Park, MD, USA
fYear
1993
fDate
20-22 Oct 1993
Firstpage
497
Lastpage
501
Abstract
An overview of an approach to the generic modeling and processing of known forms is presented. The system provides a methodology by which models are generated from regions in the document based on their usage. Automatic extraction of an optimal set of features to be used for registration is proposed, and it is shown how specialized detectors can be designed for each feature based on their position, orientation and width properties. Registration of the form with the model is accomplished using probing to establish correspondence. Form components which are corrupted by markings are detected and isolated, the intersections are interpreted and the properties of the non-form markings are used to reconstruct the strokes through the intersections. The feasibility of these ideas is demonstrated through an implementation of key components of the system
Keywords
business forms; document handling; feature extraction; automatic feature extraction; form documents; generic modeling; known forms; model generation; non-form markings; optimal set; specialized detectors; stroke reconstruction; width properties; Context modeling; Data mining; Detectors; Educational institutions; Finance; Graphics; Information analysis; Office automation; Optical character recognition software; Process design;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location
Tsukuba Science City
Print_ISBN
0-8186-4960-7
Type
conf
DOI
10.1109/ICDAR.1993.395687
Filename
395687
Link To Document