• DocumentCode
    3142798
  • Title

    Discovering essential code elements in informal documentation

  • Author

    Rigby, Peter C. ; Robillard, Martin P.

  • Author_Institution
    Dept. of Software Eng., Concordia Univ., Montreal, QC, Canada
  • fYear
    2013
  • fDate
    18-26 May 2013
  • Firstpage
    832
  • Lastpage
    841
  • Abstract
    To access the knowledge contained in developer communication, such as forum posts, it is useful to determine automatically the code elements referred to in the discussions. We propose a novel traceability recovery approach to extract the code elements contained in various documents. As opposed to previous work, our approach does not require an index of code elements to find links, which makes it particularly well-suited for the analysis of informal documentation. When evaluated on 188 StackOverflow answer posts containing 993 code elements, the technique performs with average 0.92 precision and 0.90 recall. As a major refinement on traditional traceability approaches, we also propose to detect which of the code elements in a document are salient, or germane, to the topic of the post. To this end we developed a three-feature decision tree classifier that performs with a precision of 0.65-0.74 and recall of 0.30-0.65, depending on the subject of the document.
  • Keywords
    decision trees; software engineering; system recovery; StackOverflow answer posts; essential code elements; informal documentation; three-feature decision tree classifier; traceability recovery approach; Benchmark testing; Compounds; Context; Documentation; Indexes; Information retrieval; Java;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering (ICSE), 2013 35th International Conference on
  • Conference_Location
    San Francisco, CA
  • Print_ISBN
    978-1-4673-3073-2
  • Type

    conf

  • DOI
    10.1109/ICSE.2013.6606629
  • Filename
    6606629