DocumentCode :
1575946
Title :
Mining Software Evolution to Predict Refactoring
Author :
Ratzinger, Jacek ; Sigmund, Thomas ; Vorburger, Peter ; Gall, Harald
Author_Institution :
Vienna Univ. of Technol., Vienna
fYear :
2007
Firstpage :
354
Lastpage :
363
Abstract :
Can we predict locations of future refactoring based on the development history? In an empirical study of open source projects we found that attributes of software evolution data can be used to predict the need for refactoring in the following two months of development. Information systems utilized in software projects provide a broad range of data for decision support. Versioning systems log each activity during the development, which we use to extract data mining features such as growth measures, relationships between classes, the number of authors working on a particular piece of code, etc. We use this information as input into classification algorithms to create prediction models for future refactoring activities. Different state-of-the-art classifiers are investigated such as decision trees, logistic model trees, prepositional rule learners, and nearest neighbor algorithms. With both high precision and high recall we can assess the refactoring proneness of object-oriented systems. Although we investigate different domains, we discovered critical factors within the development life cycle leading to refactoring, which are common among all studied projects.
Keywords :
data mining; decision trees; object-oriented methods; software engineering; data mining features; decision support; decision trees; development history; logistic model trees; mining software evolution; object-oriented systems; open source projects; prepositional rule learners; software projects; versioning systems; Classification algorithms; Data mining; Feature extraction; History; Information systems; Object oriented modeling; Open source software; Particle measurements; Prediction algorithms; Software systems;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Empirical Software Engineering and Measurement, 2007. ESEM 2007. First International Symposium on
Conference_Location :
Madrid
ISSN :
1938-6451
Print_ISBN :
978-0-7695-2886-1
Type :
conf
DOI :
10.1109/ESEM.2007.9
Filename :
4343763
Link To Document :
بازگشت