DocumentCode
731534
Title
Dataset of Developer-Labeled Commit Messages
Author
Mauczka, Andreas ; Brosch, Florian ; Schanes, Christian ; Grechenig, Thomas
Author_Institution
Inst. of Ind. Software, Vienna Univ. of Technol., Vienna, Austria
fYear
2015
fDate
16-17 May 2015
Firstpage
490
Lastpage
493
Abstract
Current research on change classification centers around automated and semi-automated approaches which are based on evaluation by either the researchers themselves or external experts. In most cases, the persons evaluating the effectiveness of the classification schemes are not the authors of the original changes and therefore can only make assumptions about the intent of the changes. To support validation of existing labeling mechanisms and to provide a training set for future approaches, we present a survey of source code changes that were labeled by their original authors. Seven developers from six different project applied three existing classification schemes from current literature to enrich their own changes with meta-information, so the intent of the changes becomes more evident. The final data set consists of 967 classified changes and is available as an SQLite database as part of the MSR data set.
Keywords
pattern classification; SQLite database; classification scheme; developer-labeled commit messages dataset; source code changes; Data mining; Data models; Databases; Labeling; Maintenance engineering; Usability;
fLanguage
English
Publisher
ieee
Conference_Titel
Mining Software Repositories (MSR), 2015 IEEE/ACM 12th Working Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/MSR.2015.71
Filename
7180125
Link To Document