Title :
How Much Do Code Repositories Include Peripheral Modifications?
Author :
Kusunoki, Noa ; Hotta, Kazuhiro ; Higo, Y. ; Kusumoto, Shinji
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
Abstract :
In the last decade, a variety of studies on mining software repositories has been conducted. Mining repositories has a potential to obtain useful knowledge for the future development and maintenance. When software repositories are mined, large commits in them are often excluded from mining targets because large commits include merging and we believe that large commits include peripheral modifications, which may affect negative impacts on mining code repositories. However, if large commits include code modifications, excluding large commits loses such modifications unintentionally. Moreover, such data cleansing assumes that there are no peripheral modifications in small commits. In this paper, we investigate how much peripheral modifications are included in commits in code repositories. As a result, we found that excluding large commits is insufficient to remove hindrances in commits for mining code repositories.
Keywords :
data mining; software maintenance; code modifications; code repositories; peripheral modifications; software development; software maintenance; software repositories mining; Data mining; Educational institutions; Maintenance engineering; Merging; Software; Software engineering; White spaces; Large commit; Mining software repositories; Source code analysis;
Conference_Titel :
Software Engineering Conference (APSEC), 2013 20th Asia-Pacific
Conference_Location :
Bangkok
Print_ISBN :
978-1-4799-2143-0
DOI :
10.1109/APSEC.2013.106