DocumentCode
181859
Title
Modeling Changeset Topics
Author
Corley, Christopher S. ; Kashuda, Kelly L. ; May, Daniel S. ; Kraft, Nicholas A.
Author_Institution
Univ. of Alabama, Tuscaloosa, AL, USA
fYear
2014
fDate
30-30 Sept. 2014
Firstpage
6
Lastpage
10
Abstract
Topic modeling has been applied to several areas of software engineering, such as bug localization, feature location, triaging change requests, and traceability link recovery. Many of these approaches combine mining unstructured data, such as bug reports, with topic modeling a snapshot (or release) of source code. However, source code evolves, which causes models to become obsolete. In this paper, we explore the approach of topic modeling changesets over the traditional release approach. We conduct an exploratory study of four open source systems. We investigate the differences in corpora in each project, and evaluate the topic distinctness of the models.
Keywords
data mining; program debugging; public domain software; software engineering; bug localization; bug reports; changeset topics modeling; feature location; open source systems; software engineering; source code; topic distinctness evaluation; traceability link recovery; triaging change requests; unstructured data mining; Data mining; Data models; History; Java; Resource management; Software maintenance; Mining software repositories; changesets; latent Dirichlet allocation; topic modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Mining Unstructured Data (MUD), 2014 IEEE 4th Workshop on
Conference_Location
Victoria, BC
Type
conf
DOI
10.1109/MUD.2014.9
Filename
6980188
Link To Document