• DocumentCode
    2345942
  • Title

    Identifying licensing of jar archives using a code-search approach

  • Author

    Di Penta, Massimiliano ; German, Daniel M. ; Antoniol, Giuliano

  • Author_Institution
    Dept. of Eng., Univ. of Sannio, Benevento, Italy
  • fYear
    2010
  • fDate
    2-3 May 2010
  • Firstpage
    151
  • Lastpage
    160
  • Abstract
    Free and open source software strongly promotes the reuse of source code. Some open source Java components/libraries are distributed as jar archives only containing the bytecode and some additional information. For whoever wanting to integrate this jar in her own project, it is important to determine the license(s) of the code from which the jar archive was produced, as this affects the way that such component can be used. This paper proposes an automatic approach to determine the license of jar archives, combining the use of a code-search engine with the automatic classification of licenses contained in textual flies enclosed in the jar. Results of an empirical study performed on 37 jars - from 17 different systems - indicate that this approach is able to successfully infer the jar licenses in over 95% of the cases, but that in many cases the license in textual flies may differ from the one of the classes contained in the jar.
  • Keywords
    Java; public domain software; search engines; text analysis; code-search approach; code-search engine; free software; jar archive license identification; open source Java components; open source Java libraries; open source software; textual flies; Computer science; Decoding; Documentation; Engines; Java; Licenses; Open source software; Software libraries; Uniform resource locators; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mining Software Repositories (MSR), 2010 7th IEEE Working Conference on
  • Conference_Location
    Cape Town
  • Print_ISBN
    978-1-4244-6802-7
  • Electronic_ISBN
    978-1-4244-6803-4
  • Type

    conf

  • DOI
    10.1109/MSR.2010.5463282
  • Filename
    5463282