Title :
Efficient analysis of complex diagrams using constraint-based parsing
Author :
Futrelle, Robert P. ; Nikolakis, Nikos
Author_Institution :
Coll. of Comput. Sci., Northeastern Univ., Boston, MA, USA
Abstract :
This paper describes substantial advances in the analysis (parsing) of diagrams using constraint grammars. The addition of set types to the grammar and spatial indexing of the data make it possible to efficiently parse real diagrams of substantial complexity. The system is probably the first to demonstrate efficient diagram parsing using grammars that easily be retargeted to other domains. The work assumes that the diagrams are available as a flat collection of graphics primitives: lines, polygons, circles, Bezier curves and text. This is appropriate for future electronic documents or for vectorized diagrams converted from scanned images. The classes of diagrams that we have analyzed include x,y data graphs and genetic diagrams drawn from the biological literature, as well as finite state automata diagrams (states and arcs). As an example, parsing a four-part data graph composed of 133 primitives required 35 sec using Macintosh Common Lisp on a Macintosh Quadra 700
Keywords :
computational complexity; computational geometry; computer graphics; document image processing; finite automata; grammars; indexing; Bezier curves; Macintosh Common Lisp; Macintosh Quadra 700; circles; complex diagrams; constraint grammars; constraint-based parsing; electronic documents; finite state automata diagrams; genetic diagrams; graphics primitives; lines; polygons; spatial indexing; vectorized diagrams; Automata; Biology; Computer science; Data analysis; Educational institutions; Genetics; Graphics; Image converters; Indexing; Laboratories;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.602019