DocumentCode
3142798
Title
Discovering essential code elements in informal documentation
Author
Rigby, Peter C. ; Robillard, Martin P.
Author_Institution
Dept. of Software Eng., Concordia Univ., Montreal, QC, Canada
fYear
2013
fDate
18-26 May 2013
Firstpage
832
Lastpage
841
Abstract
To access the knowledge contained in developer communication, such as forum posts, it is useful to determine automatically the code elements referred to in the discussions. We propose a novel traceability recovery approach to extract the code elements contained in various documents. As opposed to previous work, our approach does not require an index of code elements to find links, which makes it particularly well-suited for the analysis of informal documentation. When evaluated on 188 StackOverflow answer posts containing 993 code elements, the technique performs with average 0.92 precision and 0.90 recall. As a major refinement on traditional traceability approaches, we also propose to detect which of the code elements in a document are salient, or germane, to the topic of the post. To this end we developed a three-feature decision tree classifier that performs with a precision of 0.65-0.74 and recall of 0.30-0.65, depending on the subject of the document.
Keywords
decision trees; software engineering; system recovery; StackOverflow answer posts; essential code elements; informal documentation; three-feature decision tree classifier; traceability recovery approach; Benchmark testing; Compounds; Context; Documentation; Indexes; Information retrieval; Java;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering (ICSE), 2013 35th International Conference on
Conference_Location
San Francisco, CA
Print_ISBN
978-1-4673-3073-2
Type
conf
DOI
10.1109/ICSE.2013.6606629
Filename
6606629
Link To Document