• DocumentCode
    2630241
  • Title

    The processing of form documents

  • Author

    Doermann, David S. ; Rosenfeld, Azriel

  • Author_Institution
    Center for Autom. Res., Maryland Univ., College Park, MD, USA
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    497
  • Lastpage
    501
  • Abstract
    An overview of an approach to the generic modeling and processing of known forms is presented. The system provides a methodology by which models are generated from regions in the document based on their usage. Automatic extraction of an optimal set of features to be used for registration is proposed, and it is shown how specialized detectors can be designed for each feature based on their position, orientation and width properties. Registration of the form with the model is accomplished using probing to establish correspondence. Form components which are corrupted by markings are detected and isolated, the intersections are interpreted and the properties of the non-form markings are used to reconstruct the strokes through the intersections. The feasibility of these ideas is demonstrated through an implementation of key components of the system
  • Keywords
    business forms; document handling; feature extraction; automatic feature extraction; form documents; generic modeling; known forms; model generation; non-form markings; optimal set; specialized detectors; stroke reconstruction; width properties; Context modeling; Data mining; Detectors; Educational institutions; Finance; Graphics; Information analysis; Office automation; Optical character recognition software; Process design;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395687
  • Filename
    395687