Title :
Discussion of a Large-Scale Open Source Data Collection Methodology
Author :
Hahsler, Michael ; Koch, Stefan
Author_Institution :
Vienna University of Economics and BA
Abstract :
This paper discusses in detail a possible methodology for collecting repository data on a large number of open source software projects from a single project hosting and community site. The process of data retrieval is described along with the possible metrics that can be computed and which can be used for further analyses. Example research areas to be addressed with the available data and first results are given. Then, both advantages and disadvantages of the proposed methodology are discussed together with implications for future approaches.
Keywords :
Collaborative software; Collaborative work; Information retrieval; Large-scale systems; Law; Licenses; Linux; Open source software; Programming; Web server;
Conference_Titel :
System Sciences, 2005. HICSS '05. Proceedings of the 38th Annual Hawaii International Conference on
Print_ISBN :
0-7695-2268-8
DOI :
10.1109/HICSS.2005.204