• DocumentCode
    3587335
  • Title

    Outliers and Replication in Software Engineering

  • Author

    Larsson, Henrik ; Lindqvist, Erik ; Torkar, Richard

  • Author_Institution
    Chalmers & the Univ. of Gothenburg, Gothenburg, Sweden
  • Volume
    1
  • fYear
    2014
  • Firstpage
    207
  • Lastpage
    214
  • Abstract
    Empirical software engineering is a research field of growing interest. Studies within this field handles an increasing amount of data. In order to replicate a study the data needs to be accessible and all processing of this data needs to be reproducible. Specifically, the handling of deviating data points, also known as outliers, needs to be documented in order for a study to be replicated. This study investigated the data availability for recently published studies within empirical software engineering. Furthermore, it also investigated if outliers are documented in the same research field. Papers were reviewed using a literature review and the presence of outliers was investigated using an unsupervised outlier detection method. Only 37% of the papers reviewed had their data accessible. Furthermore, in many cases outliers were present in the reviewed studies but 63% of the papers studies did not mention how outliers were handled. The data availability within empirical software engineering research is low and is hindering replication of studies. Additionally, the lack of documentation regarding how outliers are handled is hindering replication.
  • Keywords
    data handling; software engineering; data availability; deviating data points handling; empirical software engineering research; unsupervised outlier detection method; Availability; Data collection; Detection algorithms; Estimation; Pipelines; Software; Software engineering; outliers; replication; reproducibility; software enigneering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering Conference (APSEC), 2014 21st Asia-Pacific
  • ISSN
    1530-1362
  • Print_ISBN
    978-1-4799-7425-2
  • Type

    conf

  • DOI
    10.1109/APSEC.2014.40
  • Filename
    7091312