DocumentCode :
3442292
Title :
How Much Do Code Repositories Include Peripheral Modifications?
Author :
Kusunoki, Noa ; Hotta, Kazuhiro ; Higo, Y. ; Kusumoto, Shinji
Author_Institution :
Grad. Sch. of Inf. Sci. & Technol., Osaka Univ., Suita, Japan
Volume :
2
fYear :
2013
fDate :
2-5 Dec. 2013
Firstpage :
19
Lastpage :
24
Abstract :
In the last decade, a variety of studies on mining software repositories has been conducted. Mining repositories has a potential to obtain useful knowledge for the future development and maintenance. When software repositories are mined, large commits in them are often excluded from mining targets because large commits include merging and we believe that large commits include peripheral modifications, which may affect negative impacts on mining code repositories. However, if large commits include code modifications, excluding large commits loses such modifications unintentionally. Moreover, such data cleansing assumes that there are no peripheral modifications in small commits. In this paper, we investigate how much peripheral modifications are included in commits in code repositories. As a result, we found that excluding large commits is insufficient to remove hindrances in commits for mining code repositories.
Keywords :
data mining; software maintenance; code modifications; code repositories; peripheral modifications; software development; software maintenance; software repositories mining; Data mining; Educational institutions; Maintenance engineering; Merging; Software; Software engineering; White spaces; Large commit; Mining software repositories; Source code analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering Conference (APSEC), 2013 20th Asia-Pacific
Conference_Location :
Bangkok
ISSN :
1530-1362
Print_ISBN :
978-1-4799-2143-0
Type :
conf
DOI :
10.1109/APSEC.2013.106
Filename :
6754345
Link To Document :
بازگشت