DocumentCode
2041783
Title
A statistical form reading system
Author
Li Xingyuan ; Hong Jiarong ; Zhang Zhaohui ; Chen Bin
Author_Institution
Dept. of Comput. Sci., Harbin Inst. of Technol., China
Volume
2
fYear
1993
fDate
19-21 Oct. 1993
Firstpage
1062
Abstract
In this paper, we introduce a system able to read statistical forms. The system is composed of a personal computer, a scanner and a form reading software. For a form image, the system first extracts the form lines, then locates the individual rectangle fields of the form, obtains the relation between them and sends image segment of each field to character recognition module. In character recognition, we use the AQ15 machine learning system to generate the classifier. The result can be stored in text or in database, with or without form lines. When one kind of form is first input (we refer to this form as unlearned), the attribute of each field column is determined by man-machine interaction, then the structure and the public fields of the form is recorded. When a learned form is input, the structure of the form is compared with the original structure. If no conflict, the system recognizes non-public fields consisting of numerals or printed Chinese characters, while the other fields can be input friendly.<>
Keywords
feature extraction; image scanners; learning systems; mathematical morphology; optical character recognition; AQ15 machine learning system; character recognition module; classifier; form line extraction; form reading software; image segment; man-machine interaction; personal computer; scanner; statistical form reading system; Character recognition; Computer science; Data mining; Image databases; Image segmentation; Learning systems; Man machine systems; Microcomputers; Morphology; Optical character recognition software;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON '93. Proceedings. Computer, Communication, Control and Power Engineering.1993 IEEE Region 10 Conference on
Conference_Location
Beijing, China
Print_ISBN
0-7803-1233-3
Type
conf
DOI
10.1109/TENCON.1993.320198
Filename
320198
Link To Document