• DocumentCode
    3248622
  • Title

    Using code ownership to improve IR-based Traceability Link Recovery

  • Author

    Diaz, David ; Bavota, Gabriele ; Marcus, Andrian ; Oliveto, Rocco ; Takahashi, Satoshi ; De Lucia, Andrea

  • Author_Institution
    Univ. de Los Andes, Bogota, Colombia
  • fYear
    2013
  • fDate
    20-21 May 2013
  • Firstpage
    123
  • Lastpage
    132
  • Abstract
    Information Retrieval (IR) techniques have gained wide-spread acceptance as a method for automating traceability recovery. These techniques recover links between software artifacts based on their textual similarity, i.e., the higher the similarity, the higher the likelihood that there is a link between the two artifacts. A common problem with all IR-based techniques is filtering out noise from the list of candidate links, in order to improve the recovery accuracy. Indeed, software artifacts may be related in many ways and the textual information captures only one aspect of their relationships. In this paper we propose to leverage code ownership information to capture relationships between source code artifacts for improving the recovery of traceability links between documentation and source code. Specifically, we extract the author of each source code component and for each author we identify the “context” she worked on. Thus, for a given query from the external documentation we compute the similarity between it and the context of the authors. When retrieving classes that relate to a specific query using a standard IR-based approach we reward all the classes developed by the authors having their context most similar to the query, by boosting their similarity to the query. The proposed approach, named TYRION (TraceabilitY link Recovery using Information retrieval and code OwNership), has been instantiated for the recovery of traceability links between use cases and Java classes of two software systems. The results indicate that code ownership information can be used to improve the accuracy of an IR-based traceability link recovery technique.
  • Keywords
    Java; information retrieval; software maintenance; system documentation; text analysis; IR technique; IR-based traceability link recovery improvement; Java classes; TYRION; TraceabilitY link Recovery using Information retrieval and code OwNership; class retrieval; code ownership information; documentation; noise filtering; software artifacts; software system; source code component; textual information; textual similarity; Accuracy; Context; Java; Software systems; Standards; Vocabulary; Code Ownership; Empirical Studies; Information Retrieval; Traceability Link Recovery;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Program Comprehension (ICPC), 2013 IEEE 21st International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1063-6897
  • Type

    conf

  • DOI
    10.1109/ICPC.2013.6613840
  • Filename
    6613840