DocumentCode
1490056
Title
Investigating the use of chronological split for software effort estimation
Author
Lokan, Chris ; Mendes, Eduardo
Author_Institution
Sch. of Eng. & Inf. Technol., Univ. of New South Wales at ADFA, Canberra, ACT, Australia
Volume
3
Issue
5
fYear
2009
fDate
10/1/2009 12:00:00 AM
Firstpage
422
Lastpage
434
Abstract
In previous studies, the authors investigated separately the use of two different types of chronological splits (project-by-project split and date-based split) for assigning projects to training sets and testing sets. The aim of this study is to compare the two types of chronological splits against each other, to see whether either leads to better prediction accuracy. Estimation models were built and evaluated using training and testing sets formed using project-by-project chronological splitting and date-based splitting using two different splitting dates. The authors used 906 projects from the ISBSG Release 10 repository. The authors found no significant differences between the accuracy of models built and evaluated with either of the different splitting methods. However, models built using later splitting dates were more accurate than models built using earlier splitting dates. Different accuracy with different splitting dates suggests that chronological splitting is useful. Therefore the authors recommend that training and testing sets should be formed with regard to chronology, and a date-based split appears sensible for researchers in this field.
Keywords
software development management; ISBSG Release 10 repository; chronological split; date-based splitting; project-by-project chronological splitting; software effort estimation; testing sets; training sets;
fLanguage
English
Journal_Title
Software, IET
Publisher
iet
ISSN
1751-8806
Type
jour
DOI
10.1049/iet-sen.2008.0107
Filename
5273794
Link To Document