DocumentCode
2631841
Title
Initial learning of document structure
Author
Dengel, Andreas
Author_Institution
German Res. Center for Artificial Intelligence, Kaiserslautern, Germany
fYear
1993
fDate
20-22 Oct 1993
Firstpage
86
Lastpage
90
Abstract
Proposes an approach for automatically generating a decision tree which is applied as a model for the logical labeling of business letters. Instead of top-down determination of the discriminating attributes, the system inspects a finite set of document instances that are presented to a learner in a bottom-up position. The learner itself figures out local similarities, rates them with respect to the overall structure, and determines the best structural match of two instances (neighborhood). The entire decision tree is grown step by step deducing subtrees by forming generalizations from a neighborhood. Consequently, heuristics are learned for structurally discriminating documents during subsequent classification
Keywords
business forms; decision theory; document handling; generalisation (artificial intelligence); heuristic programming; learning (artificial intelligence); pattern classification; trees (mathematics); best structural match; bottom-up inspection; business letters; classification; decision tree; discriminating attributes; document instances; document structure learning; generalizations; heuristics; local similarities; logical labeling; neighborhood; structural discrimination; subtrees; Artificial intelligence; Biomedical imaging; Classification tree analysis; Decision trees; Humans; Image classification; Labeling; Learning systems; Pattern matching; Weather forecasting;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location
Tsukuba Science City
Print_ISBN
0-8186-4960-7
Type
conf
DOI
10.1109/ICDAR.1993.395776
Filename
395776
Link To Document