• DocumentCode
    3281081
  • Title

    Identifying Software Decompositions by Applying Transaction Clustering on Source Code

  • Author

    Sindhgatta, Renuka ; Pooloth, Krishnakumar

  • Author_Institution
    Infosys Technol., Bangalore
  • Volume
    1
  • fYear
    2007
  • fDate
    24-27 July 2007
  • Firstpage
    317
  • Lastpage
    326
  • Abstract
    Majority of the software clustering algorithms use structural dependencies to decompose large software systems. While these techniques have merit, they do not always match the decompositions generated by experts who often group software entities based on their purpose. This paper presents an approach to identifying decompositions of a software system, based on the joint participation of software entities in realizing the functionality of the system. Software transactions representing units of functionality are extracted from the source code. Transactions are clustered based on the commonality of software entities used in the transactions. Our approach also assesses the use of check-in data from configuration management system for software clustering where software entities that are modified or updated together form a software transaction. We introduce CoST, a clustering tool that uses Transaction Clustering to identify software decompositions. We apply CoST to three large software systems. The results indicate that this approach produces groupings that come close to decompositions prepared by experts.
  • Keywords
    configuration management; software engineering; transaction processing; CoST; configuration management system; software decomposition; software transaction clustering; source code; Application software; Assembly; Clustering algorithms; Costs; Data mining; Java; Software algorithms; Software engineering; Software systems; Software tools;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Software and Applications Conference, 2007. COMPSAC 2007. 31st Annual International
  • Conference_Location
    Beijing
  • ISSN
    0730-3157
  • Print_ISBN
    0-7695-2870-8
  • Type

    conf

  • DOI
    10.1109/COMPSAC.2007.126
  • Filename
    4291020