• DocumentCode
    2600506
  • Title

    Ecological inference in empirical software engineering

  • Author

    Posnett, Daryl ; Filkov, Vladimir ; Devanbu, Premkumar

  • Author_Institution
    Dept. of Comput. Sci., Univ. of California, Davis, CA, USA
  • fYear
    2011
  • fDate
    6-10 Nov. 2011
  • Firstpage
    362
  • Lastpage
    371
  • Abstract
    Software systems are decomposed hierarchically, for example, into modules, packages and files. This hierarchical decomposition has a profound influence on evolvability, maintainability and work assignment. Hierarchical decomposition is thus clearly of central concern for empirical software engineering researchers; but it also poses a quandary. At what level do we study phenomena, such as quality, distribution, collaboration and productivity? At the level of files? packages? or modules? How does the level of study affect the truth, meaning, and relevance of the findings? In other fields it has been found that choosing the wrong level might lead to misleading or fallacious results. Choosing a proper level, for study, is thus vitally important for empirical software engineering research; but this issue hasn´t thus far been explicitly investigated. We describe the related idea of ecological inference and ecological fallacy from sociology and epidemiology, and explore its relevance to empirical software engineering; we also present some case studies, using defect and process data from 18 open source projects to illustrate the risks of modeling at an aggregation level in the context of defect prediction, as well as in hypothesis testing.
  • Keywords
    diseases; ecology; inference mechanisms; social sciences; software packages; ecological fallacy; ecological inference; empirical software engineering; epidemiology; hierarchical decomposition; hypothesis testing; sociology; software modules; software packages; Biological system modeling; Data models; Measurement; Object oriented modeling; Predictive models; Software engineering; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automated Software Engineering (ASE), 2011 26th IEEE/ACM International Conference on
  • Conference_Location
    Lawrence, KS
  • ISSN
    1938-4300
  • Print_ISBN
    978-1-4577-1638-6
  • Type

    conf

  • DOI
    10.1109/ASE.2011.6100074
  • Filename
    6100074