Title :
Data mining source code to facilitate program comprehension: experiments on clustering data retrieved from C++ programs
Author :
Kanellopoulos, Yiannis ; Tjortjis, Christos
Author_Institution :
Dept. of Comput., UMIST, Manchester, UK
Abstract :
This paper presents ongoing work on using data mining to discover knowledge about software systems thus facilitating program comprehension. We discuss how this work fits in the context of tool supported maintenance and comprehension and report on applying a new methodology on C++ programs. The overall framework can provide practical insights and guide the maintainer through the specifics of systems, assuming little familiarity with these. The contribution of this work is two-fold: it provides a model and associated method to extract data from C++ source code which is subsequently to be mined, and evaluates a proposed framework for clustering such data to obtain useful knowledge. The methodology is evaluated on three open source applications, results are assessed and conclusions are presented. This paper concludes with directions for future work.
Keywords :
C++ language; data mining; public domain software; reverse engineering; software maintenance; C++ programs; C++ source code; data clustering; data extraction; data mining; open source applications; program comprehension; tool supported maintenance; Application software; Character recognition; Computer languages; Data mining; Data models; Documentation; Information retrieval; Object oriented modeling; Software maintenance; Software systems;
Conference_Titel :
Program Comprehension, 2004. Proceedings. 12th IEEE International Workshop on
Print_ISBN :
0-7695-2149-5
DOI :
10.1109/WPC.2004.1311063