DocumentCode :
181859
Title :
Modeling Changeset Topics
Author :
Corley, Christopher S. ; Kashuda, Kelly L. ; May, Daniel S. ; Kraft, Nicholas A.
Author_Institution :
Univ. of Alabama, Tuscaloosa, AL, USA
fYear :
2014
fDate :
30-30 Sept. 2014
Firstpage :
6
Lastpage :
10
Abstract :
Topic modeling has been applied to several areas of software engineering, such as bug localization, feature location, triaging change requests, and traceability link recovery. Many of these approaches combine mining unstructured data, such as bug reports, with topic modeling a snapshot (or release) of source code. However, source code evolves, which causes models to become obsolete. In this paper, we explore the approach of topic modeling changesets over the traditional release approach. We conduct an exploratory study of four open source systems. We investigate the differences in corpora in each project, and evaluate the topic distinctness of the models.
Keywords :
data mining; program debugging; public domain software; software engineering; bug localization; bug reports; changeset topics modeling; feature location; open source systems; software engineering; source code; topic distinctness evaluation; traceability link recovery; triaging change requests; unstructured data mining; Data mining; Data models; History; Java; Resource management; Software maintenance; Mining software repositories; changesets; latent Dirichlet allocation; topic modeling;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Unstructured Data (MUD), 2014 IEEE 4th Workshop on
Conference_Location :
Victoria, BC
Type :
conf
DOI :
10.1109/MUD.2014.9
Filename :
6980188
Link To Document :
بازگشت