Title :
N-grams: a well-structured knowledge representation for recognition of graphical documents
Author :
Lank, Edward ; Blostein, Dorothea
Author_Institution :
Dept. of Comput. & Inf. Sci., Queen´´s Univ., Kingston, Ont., Canada
Abstract :
N-grams are a well-structured knowledge representation that has proven useful in the recognition of textual documents. In this paper, we propose that n-grams can be extended to the domain of graphical documents as well. This has the advantage of providing a regular, easily-maintainable knowledge representation for local constraints, specifically symbol-interaction knowledge, in the graphical domain. To extend the definition of an n-gram from the textual to the graphical domain, we must resolve how to handle directional considerations, how to treat the variety of relations that can occur between image primitives, and how to treat multiple neighbours. We have implemented a prototype system for the application of n-gram knowledge to the recognition of sketch maps. Early results are encouraging. We look forward to further refining this system. N-grams appear to be a promising tool for representing spatial constraints in a form that is useful to the recognition of graphical documents
Keywords :
constraint handling; constraint theory; document image processing; image recognition; knowledge representation; nomograms; spatial data structures; directional considerations; graphical document recognition; image primitives; knowledge representation; local constraints; multiple neighbours; n-grams; prototype system; sketch map recognition; spatial constraints; symbol-interaction knowledge; Character recognition; Data structures; Dictionaries; Engineering drawings; Image recognition; Image resolution; Information science; Knowledge representation; Prototypes; Statistics;
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
DOI :
10.1109/ICDAR.1997.620621