• DocumentCode
    2041783
  • Title

    A statistical form reading system

  • Author

    Li Xingyuan ; Hong Jiarong ; Zhang Zhaohui ; Chen Bin

  • Author_Institution
    Dept. of Comput. Sci., Harbin Inst. of Technol., China
  • Volume
    2
  • fYear
    1993
  • fDate
    19-21 Oct. 1993
  • Firstpage
    1062
  • Abstract
    In this paper, we introduce a system able to read statistical forms. The system is composed of a personal computer, a scanner and a form reading software. For a form image, the system first extracts the form lines, then locates the individual rectangle fields of the form, obtains the relation between them and sends image segment of each field to character recognition module. In character recognition, we use the AQ15 machine learning system to generate the classifier. The result can be stored in text or in database, with or without form lines. When one kind of form is first input (we refer to this form as unlearned), the attribute of each field column is determined by man-machine interaction, then the structure and the public fields of the form is recorded. When a learned form is input, the structure of the form is compared with the original structure. If no conflict, the system recognizes non-public fields consisting of numerals or printed Chinese characters, while the other fields can be input friendly.<>
  • Keywords
    feature extraction; image scanners; learning systems; mathematical morphology; optical character recognition; AQ15 machine learning system; character recognition module; classifier; form line extraction; form reading software; image segment; man-machine interaction; personal computer; scanner; statistical form reading system; Character recognition; Computer science; Data mining; Image databases; Image segmentation; Learning systems; Man machine systems; Microcomputers; Morphology; Optical character recognition software;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON '93. Proceedings. Computer, Communication, Control and Power Engineering.1993 IEEE Region 10 Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-1233-3
  • Type

    conf

  • DOI
    10.1109/TENCON.1993.320198
  • Filename
    320198