Title :
Mining source code descriptions from developer communications
Author :
Panichella, Sebastiano ; Aponte, Jairo ; Penta, Massimiliano Di ; Marcus, Andrian ; Canfora, Gerardo
Author_Institution :
Dept. of Eng.-RCOST, Univ. of Sannio, Benevento, Italy
Abstract :
Very often, source code lacks comments that adequately describe its behavior. In such situations developers need to infer knowledge from the source code itself or to search for source code descriptions in external artifacts. We argue that messages exchanged among contributors/developers, in the form of bug reports and emails, are a useful source of information to help understanding source code. However, such communications are unstructured and usually not explicitly meant to describe specific parts of the source code. Developers searching for code descriptions within communications face the challenge of filtering large amount of data to extract what pieces of information are important to them. We propose an approach to automatically extract method descriptions from communications in bug tracking systems and mailing lists. We have evaluated the approach on bug reports and mailing lists from two open source systems (Lucene and Eclipse). The results indicate that mailing lists and bug reports contain relevant descriptions of about 36% of the methods from Lucene and 7% from Eclipse, and that the proposed approach is able to extract such descriptions with a precision of up to 79% for Eclipse and 87% for Lucene. The extracted method descriptions can help developers in understanding the code and could also be used as a starting point for source code re-documentation.
Keywords :
data mining; program compilers; program debugging; source coding; bug reports; bug tracking systems; code descriptions; developer communications; emails; external artifacts; information source; mailing lists; mining source code descriptions; open source systems; source code redocumentation; Context; Data mining; Educational institutions; Electronic mail; Filtering; Java; Syntactics; Code re-documentation; mining e-mails; program comprehension;
Conference_Titel :
Program Comprehension (ICPC), 2012 IEEE 20th International Conference on
Conference_Location :
Passau
Print_ISBN :
978-1-4673-1213-4
Electronic_ISBN :
1092-8138
DOI :
10.1109/ICPC.2012.6240510