• DocumentCode
    2209597
  • Title

    What´s hot and what´s not: Windowed developer topic analysis

  • Author

    Hindle, Abram ; Godfrey, Michael W. ; Holt, Richard C.

  • Author_Institution
    Univ. of Waterloo, Waterloo, ON, Canada
  • fYear
    2009
  • fDate
    20-26 Sept. 2009
  • Firstpage
    339
  • Lastpage
    348
  • Abstract
    As development on a software project progresses, developers shift their focus between different topics and tasks many times. Managers and newcomer developers often seek ways of understanding what tasks have recently been worked on and how much effort has gone into each; for example, a manager might wonder what unexpected tasks occupied their team´s attention during a period when they were supposed to have been implementing new features. Tools such as Latent Dirichlet Allocation (LDA) and Latent Semantic Indexing (LSI) can be used to extract a set of independent topics from a corpus of commit-log comments. Previous work in the area has created a single set of topics by analyzing comments from the entire lifetime of the project. In this paper, we propose windowing the topic analysis to give a more nuanced view of the system´s evolution. By using a defined time-window of, for example, one month, we can track which topics come and go over time, and which ones recur. We propose visualizations of this model that allows us to explore the evolving stream of topics of development occurring over time. We demonstrate that windowed topic analysis offers advantages over topic analysis applied to a project´s lifetime because many topics are quite local.
  • Keywords
    program visualisation; programming language semantics; software engineering; word processing; commit-log comments; latent dirichlet allocation tool; latent semantic indexing tool; software developers; software project development; windowed topic analysis; Control systems; Database systems; History; Indexing; Large scale integration; Linear discriminant analysis; Pattern analysis; Performance analysis; Software development management; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance, 2009. ICSM 2009. IEEE International Conference on
  • Conference_Location
    Edmonton, AB
  • ISSN
    1063-6773
  • Print_ISBN
    978-1-4244-4897-5
  • Electronic_ISBN
    1063-6773
  • Type

    conf

  • DOI
    10.1109/ICSM.2009.5306310
  • Filename
    5306310