Title :
Generating robust parsers using island grammars
Author_Institution :
CWI, Amsterdam, Netherlands
fDate :
6/23/1905 12:00:00 AM
Abstract :
Source model extraction, the automated extraction of information from system artifacts, is a common phase in reverse engineering tools. One of the major challenges of this phase is creating extractors that can deal with irregularities in the artifacts that are typical for the reverse engineering domain (for example, syntactic errors, incomplete source code, language dialects and embedded languages). The paper proposes a solution in the form of island grammars, a special kind of grammar that combines the detailed specification possibilities of grammars with the liberal behavior of lexical approaches. We show how island grammars can be used to generate robust parsers that combine the accuracy of syntactical analysis with the speed, flexibility and tolerance usually only found in lexical analysis. We conclude with a discussion of the development of MANGROVE, a generator for source model extractors based on island grammars and describe its application to a number of case studies
Keywords :
computational linguistics; grammars; program compilers; reverse engineering; MANGROVE; automated information extraction; case studies; detailed specification; embedded languages; fuzzy parsing; incomplete source code; island grammars; language dialects; lexical approaches; parser generation; partial parsing; program analysis; reverse engineering domain; reverse engineering tools; robust parser generation; robust parsers; source model extraction; source model extractors; syntactic errors; syntactical analysis; system artifacts; Application software; Computer languages; Data mining; Libraries; Maintenance engineering; Mars; Reverse engineering; Robustness; Software maintenance; Transaction databases;
Conference_Titel :
Reverse Engineering, 2001. Proceedings. Eighth Working Conference on
Conference_Location :
Stuttgart
Print_ISBN :
0-7695-1303-4
DOI :
10.1109/WCRE.2001.957806