DocumentCode :
2347163
Title :
Validity of network analyses in Open Source Projects
Author :
Nia, Roozbeh ; Bird, Christian ; Devanbu, Premkumar ; Filkov, Vladimir
Author_Institution :
Comput. Sci. Dept., Univ. of California, Davis, CA, USA
fYear :
2010
fDate :
2-3 May 2010
Firstpage :
201
Lastpage :
209
Abstract :
Social network methods are frequently used to analyze networks derived from Open Source Project communication and collaboration data. Such studies typically discover patterns in the information flow between contributors or contributions in these projects. Social network metrics have also been used to predict defect occurrence. However, such studies often ignore or side-step the issue of whether (and in what way) the metrics and networks of study are influenced by inadequate or missing data. In previous studies email archives of OSS projects have provided a useful trace of the communication and co-ordination activities of the participants. These traces have been used to construct social networks that are then subject to various types of analysis. However, during the construction of these networks, some assumptions are made, that may not always hold; this leads to incomplete, and sometimes incorrect networks. The question then becomes, do these errors affect the validity of the ensuing analysis? In this paper we specifically examine the stability of network metrics in the presence of inadequate and missing data. The issues that we study are: 1) the effect of paths with broken information flow (i.e. consecutive edges which are out of temporal order) on measures of centrality of nodes in the network, and 2) the effect of missing links on such measures. We demonstrate on three different OSS projects that while these issues do change network topology, the metrics used in the analysis are stable with respect to such changes.
Keywords :
public domain software; social networking (online); software metrics; network analysis validity; network metric stability; open source projects; social network methods; social network metrics; Birds; Collaboration; Communication networks; Computer science; Fluid flow measurement; Information analysis; Network topology; Social network services; Software engineering; Stability; Information Flow; Open Source; Social Networks;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Software Repositories (MSR), 2010 7th IEEE Working Conference on
Conference_Location :
Cape Town
Print_ISBN :
978-1-4244-6802-7
Electronic_ISBN :
978-1-4244-6803-4
Type :
conf
DOI :
10.1109/MSR.2010.5463342
Filename :
5463342
Link To Document :
بازگشت