• DocumentCode
    2631841
  • Title

    Initial learning of document structure

  • Author

    Dengel, Andreas

  • Author_Institution
    German Res. Center for Artificial Intelligence, Kaiserslautern, Germany
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    86
  • Lastpage
    90
  • Abstract
    Proposes an approach for automatically generating a decision tree which is applied as a model for the logical labeling of business letters. Instead of top-down determination of the discriminating attributes, the system inspects a finite set of document instances that are presented to a learner in a bottom-up position. The learner itself figures out local similarities, rates them with respect to the overall structure, and determines the best structural match of two instances (neighborhood). The entire decision tree is grown step by step deducing subtrees by forming generalizations from a neighborhood. Consequently, heuristics are learned for structurally discriminating documents during subsequent classification
  • Keywords
    business forms; decision theory; document handling; generalisation (artificial intelligence); heuristic programming; learning (artificial intelligence); pattern classification; trees (mathematics); best structural match; bottom-up inspection; business letters; classification; decision tree; discriminating attributes; document instances; document structure learning; generalizations; heuristics; local similarities; logical labeling; neighborhood; structural discrimination; subtrees; Artificial intelligence; Biomedical imaging; Classification tree analysis; Decision trees; Humans; Image classification; Labeling; Learning systems; Pattern matching; Weather forecasting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395776
  • Filename
    395776