• DocumentCode
    2630398
  • Title

    Understanding structured text documents by a model based document analysis system

  • Author

    Bayer, T.A.

  • Author_Institution
    Daimler-Benz Res., Ulm, Germany
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    448
  • Lastpage
    453
  • Abstract
    A document analysis system which is capable of extracting the semantics of specific text portions of structured documents is presented. The architecture of the analysis system is based on a knowledge representation scheme, a semantic network, called Resco-Frame Representation of Structured Documents. It allows the definition of knowledge about document components as well as knowledge about analysis algorithms in a uniform, simple, but powerful representation formalism. Hence, this architecture enables the analysis system to exploit the specific power of both the algorithmic knowledge describing the properties of algorithms and the declarative knowledge about properties of text objects in documents. The inference engine and the control algorithm show how these two knowledge sources are combined and utilized. The flexibility of the representation formalism Fresco, the recognition results and the computational complexity of the inference algorithm are presented in two different applications
  • Keywords
    document handling; model-based reasoning; semantic networks; control algorithm; declarative knowledge; inference engine; knowledge representation scheme; model based document analysis system; representation formalism; semantic network; structured text documents; text objects; Algorithm design and analysis; Computational complexity; Data mining; Document image processing; Electronics packaging; Engines; Image recognition; Inference algorithms; Optical character recognition software; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395697
  • Filename
    395697