Title :
Vector space analysis of software clones
Author :
Grant, Scott ; Cordy, James R.
Author_Institution :
Sch. of Comput., Queen´´s Univ., Kingston, ON
Abstract :
In this paper, we introduce a technique for applying independent component analysis to vector space representations of software code fragments such as methods or blocks. The distance between these points can be determined, and used as a measure of the similarity between the original source code fragments they represent. It can be reasoned that if the initial matrix representation contains enough information about the syntactic structure of the source code, the vector space representation will be sufficient to predict the similarity of fragments to one another, and can provide the likelihood that the code is a clone.
Keywords :
independent component analysis; software engineering; source coding; vectors; independent component analysis; software clones; software code fragment; source code fragments; vector space analysis; vector space representation; Blind source separation; Cloning; Data mining; Decorrelation; Functional analysis; Independent component analysis; Indexing; Large scale integration; Microphones; Software maintenance;
Conference_Titel :
Program Comprehension, 2009. ICPC '09. IEEE 17th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-3998-0
Electronic_ISBN :
1092-8138
DOI :
10.1109/ICPC.2009.5090048