• DocumentCode
    1490056
  • Title

    Investigating the use of chronological split for software effort estimation

  • Author

    Lokan, Chris ; Mendes, Eduardo

  • Author_Institution
    Sch. of Eng. & Inf. Technol., Univ. of New South Wales at ADFA, Canberra, ACT, Australia
  • Volume
    3
  • Issue
    5
  • fYear
    2009
  • fDate
    10/1/2009 12:00:00 AM
  • Firstpage
    422
  • Lastpage
    434
  • Abstract
    In previous studies, the authors investigated separately the use of two different types of chronological splits (project-by-project split and date-based split) for assigning projects to training sets and testing sets. The aim of this study is to compare the two types of chronological splits against each other, to see whether either leads to better prediction accuracy. Estimation models were built and evaluated using training and testing sets formed using project-by-project chronological splitting and date-based splitting using two different splitting dates. The authors used 906 projects from the ISBSG Release 10 repository. The authors found no significant differences between the accuracy of models built and evaluated with either of the different splitting methods. However, models built using later splitting dates were more accurate than models built using earlier splitting dates. Different accuracy with different splitting dates suggests that chronological splitting is useful. Therefore the authors recommend that training and testing sets should be formed with regard to chronology, and a date-based split appears sensible for researchers in this field.
  • Keywords
    software development management; ISBSG Release 10 repository; chronological split; date-based splitting; project-by-project chronological splitting; software effort estimation; testing sets; training sets;
  • fLanguage
    English
  • Journal_Title
    Software, IET
  • Publisher
    iet
  • ISSN
    1751-8806
  • Type

    jour

  • DOI
    10.1049/iet-sen.2008.0107
  • Filename
    5273794