Title :
An Extended Line-Based Approach to Detect Code Clones Using Syntactic and Lexical Information
Author_Institution :
Dept. of Bus. Adm. & Inf. Sci., Chubu Univ., Kasugai, Japan
Abstract :
This paper proposes a new line-based approach for the detection of code clones using syntactic and lexical information. A customized compiler writes a source code representation that contains syntactic and lexical information. A new clone detection tool called LePalex reads the source code representation, and converts it to three types of code: first normal form, second normal form, and third normal form. The first normal form is used to detect the exact match of code clones. The second normal form is used to detect the syntactic match of code clones. The third normal form is used to check for syntactically correct segments of code clones. This paper demonstrates the advantage of this approach in achieving programming language independence using syntactic and lexical information.
Keywords :
application program interfaces; program compilers; LePalex; application program interfaces; code clone detection; customized compiler; extended line-based approach; lexical information; programming language independence; source code representation; syntactic information; Cloning; Computer bugs; Computer languages; Graphical user interfaces; Information science; Information technology; Java; Libraries; Operating systems; Testing; Code Clone; Source Code Representation; String Matching;
Conference_Titel :
Information Technology: New Generations (ITNG), 2010 Seventh International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-6270-4
DOI :
10.1109/ITNG.2010.176