• DocumentCode
    2417540
  • Title

    The Commit Size Distribution of Open Source Software

  • Author

    Arafat, Omar ; Riehle, Dirk

  • Author_Institution
    SAP Res., SAP Labs. LLC, Palo Alto, CA
  • fYear
    2009
  • fDate
    5-8 Jan. 2009
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    With the growing economic importance of open source, we need to improve our understanding of how open source software development processes work. The analysis of code contributions to open source projects is an important part of such research. In this paper we analyze the size of code contributions to more than 9,000 open source projects. We review the total distribution and distinguish three categories of code contributions using a size-based heuristic: single focused commits, aggregate team contributions, and repository refactorings. We find that both the overall distribution and the individual categories follow a power law. We also suggest that distinguishing these commit categories by size will benefit future analyses.
  • Keywords
    economics; public domain software; software maintenance; commit size distribution; economic; open source software development processes; repository refactorings; Aggregates; Application software; Best practices; Computer industry; Embedded software; History; Open source software; Power generation economics; Programming; Relational databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System Sciences, 2009. HICSS '09. 42nd Hawaii International Conference on
  • Conference_Location
    Big Island, HI
  • ISSN
    1530-1605
  • Print_ISBN
    978-0-7695-3450-3
  • Type

    conf

  • DOI
    10.1109/HICSS.2009.421
  • Filename
    4755633