DocumentCode :
1066114
Title :
Predicting source code changes by mining change history
Author :
Ying, Annie T T ; Murphy, Gail C. ; Ng, Raymond ; Chu-Carroll, Mark C.
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Volume :
30
Issue :
9
fYear :
2004
Firstpage :
574
Lastpage :
586
Abstract :
Software developers are often faced with modification tasks that involve source which is spread across a code base. Some dependencies between source code, such as those between source code written in different languages, are difficult to determine using existing static and dynamic analyses. To augment existing analyses and to help developers identify relevant source code during a modification task, we have developed an approach that applies data mining techniques to determine change patterns - sets of files that were changed together frequently in the past - from the change history of the code base. Our hypothesis is that the change patterns can be used to recommend potentially relevant source code to a developer performing a modification task. We show that this approach can reveal valuable dependencies by applying the approach to the Eclipse and Mozilla open source projects and by evaluating the predictability and interestingness of the recommendations produced for actual modification tasks on these systems.
Keywords :
configuration management; data mining; program verification; software maintenance; software tools; Eclipse open source project; Mozilla open source project; association rules; change history; code base; data mining technique; modification task; pattern classification; pattern clustering; software developers; software maintainability; source code changes prediction; Association rules; Computer Society; Computer languages; Computer science; Data mining; Frequency; History; Pattern analysis; Programming profession; Software systems; 65; Index Terms- Enhancement; association rules; classification; clustering; data mining.; maintainability;
fLanguage :
English
Journal_Title :
Software Engineering, IEEE Transactions on
Publisher :
ieee
ISSN :
0098-5589
Type :
jour
DOI :
10.1109/TSE.2004.52
Filename :
1324645
Link To Document :
بازگشت