DocumentCode
1928712
Title
Linear complexity object-oriented similarity for clone detection and software evolution analyses
Author
Merlo, E. ; Antoniol, G. ; Di Penta, M. ; Rollo, V.F.
Author_Institution
Ecole Polytechnique de Montreal, Que., Canada
fYear
2004
fDate
11-14 Sept. 2004
Firstpage
412
Lastpage
416
Abstract
With the widespread adoption of object-oriented technologies, the lack of computationally efficient and scalable approaches is limiting the ability to model and analyze the history of large object-oriented software systems. This paper proposes an approximate representation of object-oriented code characteristics, inspired by pattern recognition centroids for clustering. An interesting application of such a representation is a linear-time complexity algorithm to detect duplicate or nearly duplicated code in object-oriented systems. The algorithm accuracy and time complexity were assessed on 11 releases of a large software system, the Eclipse framework.
Keywords
computational complexity; data flow analysis; object-oriented programming; pattern recognition; software maintenance; software prototyping; Eclipse framework; clone detection; large object-oriented software system; linear complexity object-oriented similarity; linear-time complexity algorithm; object-oriented code characteristics; object-oriented software evolution; pattern recognition centroids; software evolution analysis; source code analysis; time complexity; Application software; Cloning; Clustering algorithms; History; Object oriented modeling; Open source software; Pattern recognition; Software algorithms; Software maintenance; Software systems;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Maintenance, 2004. Proceedings. 20th IEEE International Conference on
ISSN
1063-6773
Print_ISBN
0-7695-2213-0
Type
conf
DOI
10.1109/ICSM.2004.1357826
Filename
1357826
Link To Document