DocumentCode :
2059063
Title :
Matching form lines based on a heuristic search
Author :
Bohnacker, Uli ; Schacht, Johannes ; Yücel, Tulug
Author_Institution :
Res. Center, Daimler-Benz AG, Ulm, Germany
Volume :
1
fYear :
1997
fDate :
18-20 Aug 1997
Firstpage :
86
Abstract :
A major problem in form reading applications is that form fields cannot be located exactly because of nonlinear distortions on the form images. Such nonlinear distortions appear for example on photocopied forms or on forms transmitted by fax. One way to solve this problem is to determine the form fields by considering the positions of the form lines. This paper describes a new method to find pairs of corresponding form lines on a reference form and a filled form. The advantage of this method is that the corresponding line pairs can be used to map any pixel of the filled form and the reference form without any assumption about the kind of distortion. The core of this method is an algorithm that is based on the A*-search algorithm. Two sets of horizontal or vertical lines, one from the reference form and one from the filled form, are searched for pairs of corresponding lines. The algorithm´s run time is low and nonlinear distortions of the form images hardly influence its results. With increasing complexity-i.e. increasing number of lines or decreasing image quality-the number of rejected form lines grows, but the error rate stays low
Keywords :
business forms; document image processing; edge detection; heuristic programming; image matching; search problems; A*-search algorithm; complexity; corresponding line pairs; error rate; facsimile transmission; filled form; form field location; form identification; form image nonlinear distortions; form line matching; form reading applications; form recognition; form structures; heuristic search; horizontal lines; image quality; photocopied forms; pixel mapping; reference form; rejected form lines; run time; vertical lines; Algorithm design and analysis; Data mining; Humans; Image quality; Nonlinear distortion; Printing; Runtime; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
Type :
conf
DOI :
10.1109/ICDAR.1997.619819
Filename :
619819
Link To Document :
بازگشت