DocumentCode
3112402
Title
Concept to commit: A pattern designed to trace code changes from user requests to change implementation by analyzing mailing lists and code repositories
Author
McGrath, Sean ; Bastola, Kiran ; Siy, Harvey
Author_Institution
Coll. of Inf. Sci. & Technol., Univ. of Nebraska at Omaha, Omaha, NE, USA
fYear
2013
fDate
21-21 May 2013
Firstpage
6
Lastpage
8
Abstract
The concept to commit pattern is used for tracing code changes from user requests (analyzing the mailing list) to change implementation (analyzing the code repository). The analysis is done via text mining of both emails and commits descriptions in 4 stages. The first stage is identifying a search time window for the mailing list by evaluating a targeted commit time stamp. Once a window is established, the body of the mailing list is reduced to match the search window. The next stage involves basic text mining processing (tokenization, stemming, and document matrix creation). The final step is to perform frequency analysis (word cloud, heat map, or dendrogram).
Keywords
data mining; electronic mail; mailing systems; search problems; text analysis; change implementation; code change tracing; code repository analysis; commit time stamp; concept to commit pattern; dendrogram; document matrix creation; frequency analysis; heat map; mailing list analysis; search time window; stemming; text mining; tokenization; user requests; word cloud; Bioinformatics; Communities; Electronic mail; Indexes; Software; Text mining; Code Repositories; Data Mining; Mailing lists; R;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Analysis Patterns in Software Engineering (DAPSE), 2013 1st International Workshop on
Conference_Location
San Francisco, CA
Type
conf
DOI
10.1109/DAPSE.2013.6603799
Filename
6603799
Link To Document