• DocumentCode
    3112402
  • Title

    Concept to commit: A pattern designed to trace code changes from user requests to change implementation by analyzing mailing lists and code repositories

  • Author

    McGrath, Sean ; Bastola, Kiran ; Siy, Harvey

  • Author_Institution
    Coll. of Inf. Sci. & Technol., Univ. of Nebraska at Omaha, Omaha, NE, USA
  • fYear
    2013
  • fDate
    21-21 May 2013
  • Firstpage
    6
  • Lastpage
    8
  • Abstract
    The concept to commit pattern is used for tracing code changes from user requests (analyzing the mailing list) to change implementation (analyzing the code repository). The analysis is done via text mining of both emails and commits descriptions in 4 stages. The first stage is identifying a search time window for the mailing list by evaluating a targeted commit time stamp. Once a window is established, the body of the mailing list is reduced to match the search window. The next stage involves basic text mining processing (tokenization, stemming, and document matrix creation). The final step is to perform frequency analysis (word cloud, heat map, or dendrogram).
  • Keywords
    data mining; electronic mail; mailing systems; search problems; text analysis; change implementation; code change tracing; code repository analysis; commit time stamp; concept to commit pattern; dendrogram; document matrix creation; frequency analysis; heat map; mailing list analysis; search time window; stemming; text mining; tokenization; user requests; word cloud; Bioinformatics; Communities; Electronic mail; Indexes; Software; Text mining; Code Repositories; Data Mining; Mailing lists; R;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Analysis Patterns in Software Engineering (DAPSE), 2013 1st International Workshop on
  • Conference_Location
    San Francisco, CA
  • Type

    conf

  • DOI
    10.1109/DAPSE.2013.6603799
  • Filename
    6603799