• DocumentCode
    3132918
  • Title

    Data mining source code to facilitate program comprehension: experiments on clustering data retrieved from C++ programs

  • Author

    Kanellopoulos, Yiannis ; Tjortjis, Christos

  • Author_Institution
    Dept. of Comput., UMIST, Manchester, UK
  • fYear
    2004
  • fDate
    24-26 June 2004
  • Firstpage
    214
  • Lastpage
    223
  • Abstract
    This paper presents ongoing work on using data mining to discover knowledge about software systems thus facilitating program comprehension. We discuss how this work fits in the context of tool supported maintenance and comprehension and report on applying a new methodology on C++ programs. The overall framework can provide practical insights and guide the maintainer through the specifics of systems, assuming little familiarity with these. The contribution of this work is two-fold: it provides a model and associated method to extract data from C++ source code which is subsequently to be mined, and evaluates a proposed framework for clustering such data to obtain useful knowledge. The methodology is evaluated on three open source applications, results are assessed and conclusions are presented. This paper concludes with directions for future work.
  • Keywords
    C++ language; data mining; public domain software; reverse engineering; software maintenance; C++ programs; C++ source code; data clustering; data extraction; data mining; open source applications; program comprehension; tool supported maintenance; Application software; Character recognition; Computer languages; Data mining; Data models; Documentation; Information retrieval; Object oriented modeling; Software maintenance; Software systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Program Comprehension, 2004. Proceedings. 12th IEEE International Workshop on
  • ISSN
    1092-8138
  • Print_ISBN
    0-7695-2149-5
  • Type

    conf

  • DOI
    10.1109/WPC.2004.1311063
  • Filename
    1311063