• DocumentCode
    1863308
  • Title

    Filtering of Inconsistent Software Project Data for Analogy-Based Effort Estimation

  • Author

    Le-Do, Tuan Khanh ; Yoon, Kyung-A ; Seo, Yeong-Seok ; Bae, Doo-Hwan

  • Author_Institution
    Dept. of Comput. Sci., Korea Adv. Inst. of Sci. & Technol. (KAIST), Daejeon, South Korea
  • fYear
    2010
  • fDate
    19-23 July 2010
  • Firstpage
    503
  • Lastpage
    508
  • Abstract
    Accurate software effort estimation is essential for successful project management. To improve the accuracy, a number of estimation techniques have been developed. Among those, Analogy-Based Estimation (ABE) has become one of the mainstreams of effort estimation. In general, ABE infers the effort to accomplish a new project from the efforts of the historical projects which possess similar characteristics. ABE is simple, yet it can be affected by the noise in historical projects. Noise is generally the data corruptions which may cause negative affect on the performance of a model built on the historical data. In this study, we propose an approach to filtering noise in the historical projects to improve the accuracy of ABE. We introduce and measure the Effort-Inconsistency Degree (EID), the degree that the effort of a historical project is inconsistent from those of its similar projects. Based on EID, we identify and filter the noise in terms of the inconsistent historical project data. We have validated the performance of ABE with our approach and three representative filtering techniques, namely the Edited Nearest Neighbor algorithm, the Univariate Outlier Elimination, and the Genetic Algorithm based project selection, on three software project datasets (Desharnais, Maxwell, and ISBSG (International Software Benchmarking Standards Group) Telecom). The experimental results suggest that our approach can improve the accuracy of ABE more effectively than can the other approaches.
  • Keywords
    genetic algorithms; pattern classification; project management; software management; ABE; analogy based effort estimation; data corruption; edited nearest neighbor algorithm; effort inconsistency degree; filtering noise; genetic algorithm; inconsistent historical project data; inconsistent software project data filtering; project management; project selection; representative filtering technique; software project dataset; univariate outlier elimination; Accuracy; Estimation; Filtering; Gallium; Genetic algorithms; Noise; Software; Effort-Inconsistency Degree; Software effort estimation; analogy-based estimation; inconsistent data; noise data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Software and Applications Conference (COMPSAC), 2010 IEEE 34th Annual
  • Conference_Location
    Seoul
  • ISSN
    0730-3157
  • Print_ISBN
    978-1-4244-7512-4
  • Electronic_ISBN
    0730-3157
  • Type

    conf

  • DOI
    10.1109/COMPSAC.2010.56
  • Filename
    5676301