Title :
Quantitative Determination of the Relationship between Internal Validity and Bias in Software Engineering Experiments: Consequences for Systematic Literature Reviews
Author :
Dieste, Oscar ; Griman, Anna ; Juristo, Natalia ; Saxena, Himanshu
Author_Institution :
Fac. de Inf., Univ. Politec. de Madrid, Madrid, Spain
Abstract :
Quality assessment is one of the activities performed as part of systematic literature reviews. It is commonly accepted that a good quality experiment is bias free. Bias is considered to be related to internal validity (e.g., how adequately the experiment is planned, executed and analysed). Quality assessment is usually conducted using checklists and quality scales. It has not yet been proven, however, that quality is related to experimental bias. Aim: Identify whether there is a relationship between internal validity and bias in software engineering experiments. Method: We built a quality scale to determine the quality of the studies, which we applied to 28 experiments included in two systematic literature reviews. We proposed an objective indicator of experimental bias, which we applied to the same 28 experiments. Finally, we analysed the correlations between the quality scores and the proposed measure of bias. Results: We failed to find a relationship between the global quality score (resulting from the quality scale) and bias, however, we did identify interesting correlations between bias and some particular aspects of internal validity measured by the instrument. Conclusions: There is an empirically provable relationship between internal validity and bias. It is feasible to apply quality assessment in systematic literature reviews, subject to limits on the internal validity aspects for consideration.
Keywords :
software engineering; internal validity; quality assessment; quantitative determination; software engineering experiments; systematic literature reviews; Context; Correlation; Inspection; Instruments; Quality assessment; Software engineering; Systematics; Checklist; Quality Assessment (QA) of experiments; Quality Scale; Systematic Literature Review (SLR);
Conference_Titel :
Empirical Software Engineering and Measurement (ESEM), 2011 International Symposium on
Conference_Location :
Banff, AB
Print_ISBN :
978-1-4577-2203-5
DOI :
10.1109/ESEM.2011.37