• DocumentCode
    2600110
  • Title

    Improving automated documentation to code traceability by combining retrieval techniques

  • Author

    Chen, Xiaofan ; Grundy, John

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Auckland, Auckland, New Zealand
  • fYear
    2011
  • fDate
    6-10 Nov. 2011
  • Firstpage
    223
  • Lastpage
    232
  • Abstract
    Documentation written in natural language and source code are two of the major artifacts of a software system. Tracking a variety of traceability links between software documentation and source code assists software developers in comprehension, efficient development, and effective management of a system. Automated traceability systems to date have been faced with a major open research challenge: how to extract these links with both high precision and high recall. In this paper we introduce an approach that combines three supporting techniques, Regular Expression, Key Phrases, and Clustering, with a Vector Space Model (VSM) to improve the performance of automated traceability between documents and source code. This combination approach takes advantage of strengths of the three techniques to ameliorate limitations of VSM. Four case studies have been used to evaluate our combined technique approach. Experimental results indicate that our approach improves the performance of VSM, increases the precision of retrieved links, and recovers more true links than VSM alone.
  • Keywords
    information retrieval; natural languages; pattern clustering; program diagnostics; source coding; system documentation; automated code traceability systems; automated software documentation; natural language; regular expression; retrieval techniques; source code; vector space model; Clustering algorithms; Documentation; Engines; Software systems; Unified modeling language; Vectors; Clustering; Key Phrases; Regular Expression; Traceability; Vector Space Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automated Software Engineering (ASE), 2011 26th IEEE/ACM International Conference on
  • Conference_Location
    Lawrence, KS
  • ISSN
    1938-4300
  • Print_ISBN
    978-1-4577-1638-6
  • Type

    conf

  • DOI
    10.1109/ASE.2011.6100057
  • Filename
    6100057