DocumentCode :
243687
Title :
What Makes an Open Source Code Popular on Git Hub?
Author :
Weber, Simon ; Jiebo Luo
Author_Institution :
Dept. of Comput. Sci., Univ. of Rochester, Rochester, NY, USA
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
851
Lastpage :
855
Abstract :
The rise of social networks for software development has attached a notion of popularity to open source projects. This work attempts to extract knowledge from the differences between popular and unpopular Python projects on GitHub. A large set of projects was mined for a rich variety of features that measure language utilization, documentation, and code volume. These features were used to train a classifier which predicted current popularity well (F-score = ×8). Notably, these features outperformed measures of author popularity (F-score = ×7). However, these features did not strongly predict future growth in popularity. An in-depth analysis of the perform ant features revealed that they could be useful as a measure of not only popularity, but of code quality.
Keywords :
knowledge acquisition; public domain software; social networking (online); software engineering; source code (software); system documentation; GitHub; Python project; code quality; code volume; documentation; in-depth analysis; knowledge extraction; language utilization; open source code; open source project; social network; software development; Communities; Conferences; Current measurement; Data mining; Documentation; Feature extraction; Software; Git Hub; Python; open source software; popularity prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.55
Filename :
7022684
Link To Document :
بازگشت