• DocumentCode
    181859
  • Title

    Modeling Changeset Topics

  • Author

    Corley, Christopher S. ; Kashuda, Kelly L. ; May, Daniel S. ; Kraft, Nicholas A.

  • Author_Institution
    Univ. of Alabama, Tuscaloosa, AL, USA
  • fYear
    2014
  • fDate
    30-30 Sept. 2014
  • Firstpage
    6
  • Lastpage
    10
  • Abstract
    Topic modeling has been applied to several areas of software engineering, such as bug localization, feature location, triaging change requests, and traceability link recovery. Many of these approaches combine mining unstructured data, such as bug reports, with topic modeling a snapshot (or release) of source code. However, source code evolves, which causes models to become obsolete. In this paper, we explore the approach of topic modeling changesets over the traditional release approach. We conduct an exploratory study of four open source systems. We investigate the differences in corpora in each project, and evaluate the topic distinctness of the models.
  • Keywords
    data mining; program debugging; public domain software; software engineering; bug localization; bug reports; changeset topics modeling; feature location; open source systems; software engineering; source code; topic distinctness evaluation; traceability link recovery; triaging change requests; unstructured data mining; Data mining; Data models; History; Java; Resource management; Software maintenance; Mining software repositories; changesets; latent Dirichlet allocation; topic modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mining Unstructured Data (MUD), 2014 IEEE 4th Workshop on
  • Conference_Location
    Victoria, BC
  • Type

    conf

  • DOI
    10.1109/MUD.2014.9
  • Filename
    6980188