• DocumentCode
    3326796
  • Title

    Analysis of Activity in the Open Source Software Development Community

  • Author

    Christley, Scott ; Madey, Greg

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Notre Dame Univ.
  • fYear
    2007
  • fDate
    Jan. 2007
  • Abstract
    Open source software is computer software for which the source code is publicly open for inspection, modification, and redistribution. While research of a few, large, successful projects have provided insights into the nature and practices of the open source software community; it still leaves open the question about the thousands of other open source projects which are neither large or highly successful. In this paper, we describe a data set of SourceForge.net, the world´s largest open source software development site, which is available for research purposes; we discuss various data mining techniques that can be applied to the data and the type of research questions that can be answered. We apply a few of these techniques and provide analysis of the results
  • Keywords
    Web sites; data mining; public domain software; very large databases; SourceForge.net open source software development Web site; computer software; data mining techniques; Computer bugs; Data mining; Databases; History; Information analysis; Inspection; Open source software; Programming; Project management; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2007. HICSS 2007. 40th Annual Hawaii International Conference on
  • Conference_Location
    Waikoloa, HI
  • ISSN
    1530-1605
  • Electronic_ISBN
    1530-1605
  • Type

    conf

  • DOI
    10.1109/HICSS.2007.74
  • Filename
    4076711