DocumentCode
2600506
Title
Ecological inference in empirical software engineering
Author
Posnett, Daryl ; Filkov, Vladimir ; Devanbu, Premkumar
Author_Institution
Dept. of Comput. Sci., Univ. of California, Davis, CA, USA
fYear
2011
fDate
6-10 Nov. 2011
Firstpage
362
Lastpage
371
Abstract
Software systems are decomposed hierarchically, for example, into modules, packages and files. This hierarchical decomposition has a profound influence on evolvability, maintainability and work assignment. Hierarchical decomposition is thus clearly of central concern for empirical software engineering researchers; but it also poses a quandary. At what level do we study phenomena, such as quality, distribution, collaboration and productivity? At the level of files? packages? or modules? How does the level of study affect the truth, meaning, and relevance of the findings? In other fields it has been found that choosing the wrong level might lead to misleading or fallacious results. Choosing a proper level, for study, is thus vitally important for empirical software engineering research; but this issue hasn´t thus far been explicitly investigated. We describe the related idea of ecological inference and ecological fallacy from sociology and epidemiology, and explore its relevance to empirical software engineering; we also present some case studies, using defect and process data from 18 open source projects to illustrate the risks of modeling at an aggregation level in the context of defect prediction, as well as in hypothesis testing.
Keywords
diseases; ecology; inference mechanisms; social sciences; software packages; ecological fallacy; ecological inference; empirical software engineering; epidemiology; hierarchical decomposition; hypothesis testing; sociology; software modules; software packages; Biological system modeling; Data models; Measurement; Object oriented modeling; Predictive models; Software engineering; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Automated Software Engineering (ASE), 2011 26th IEEE/ACM International Conference on
Conference_Location
Lawrence, KS
ISSN
1938-4300
Print_ISBN
978-1-4577-1638-6
Type
conf
DOI
10.1109/ASE.2011.6100074
Filename
6100074
Link To Document