DocumentCode
3248622
Title
Using code ownership to improve IR-based Traceability Link Recovery
Author
Diaz, David ; Bavota, Gabriele ; Marcus, Andrian ; Oliveto, Rocco ; Takahashi, Satoshi ; De Lucia, Andrea
Author_Institution
Univ. de Los Andes, Bogota, Colombia
fYear
2013
fDate
20-21 May 2013
Firstpage
123
Lastpage
132
Abstract
Information Retrieval (IR) techniques have gained wide-spread acceptance as a method for automating traceability recovery. These techniques recover links between software artifacts based on their textual similarity, i.e., the higher the similarity, the higher the likelihood that there is a link between the two artifacts. A common problem with all IR-based techniques is filtering out noise from the list of candidate links, in order to improve the recovery accuracy. Indeed, software artifacts may be related in many ways and the textual information captures only one aspect of their relationships. In this paper we propose to leverage code ownership information to capture relationships between source code artifacts for improving the recovery of traceability links between documentation and source code. Specifically, we extract the author of each source code component and for each author we identify the “context” she worked on. Thus, for a given query from the external documentation we compute the similarity between it and the context of the authors. When retrieving classes that relate to a specific query using a standard IR-based approach we reward all the classes developed by the authors having their context most similar to the query, by boosting their similarity to the query. The proposed approach, named TYRION (TraceabilitY link Recovery using Information retrieval and code OwNership), has been instantiated for the recovery of traceability links between use cases and Java classes of two software systems. The results indicate that code ownership information can be used to improve the accuracy of an IR-based traceability link recovery technique.
Keywords
Java; information retrieval; software maintenance; system documentation; text analysis; IR technique; IR-based traceability link recovery improvement; Java classes; TYRION; TraceabilitY link Recovery using Information retrieval and code OwNership; class retrieval; code ownership information; documentation; noise filtering; software artifacts; software system; source code component; textual information; textual similarity; Accuracy; Context; Java; Software systems; Standards; Vocabulary; Code Ownership; Empirical Studies; Information Retrieval; Traceability Link Recovery;
fLanguage
English
Publisher
ieee
Conference_Titel
Program Comprehension (ICPC), 2013 IEEE 21st International Conference on
Conference_Location
San Francisco, CA
ISSN
1063-6897
Type
conf
DOI
10.1109/ICPC.2013.6613840
Filename
6613840
Link To Document