• DocumentCode
    2075061
  • Title

    Linking e-mails and source code artifacts

  • Author

    Bacchelli, Alberto ; Lanza, Michele ; Robbes, Romain

  • Volume
    1
  • fYear
    2010
  • fDate
    2-8 May 2010
  • Firstpage
    375
  • Lastpage
    384
  • Abstract
    E-mails concerning the development issues of a system constitute an important source of information about high-level design decisions, low-level implementation concerns, and the social structure of developers. Establishing links between e-mails and the software artifacts they discuss is a non-trivial problem, due to the inherently informal nature of human communication. Different approaches can be brought into play to tackle this trace-ability issue, but the question of how they can be evaluated remains unaddressed, as there is no recognized benchmark against which they can be compared. In this article we present such a benchmark, which we created through the manual inspection of a statistically significant number of e-mails pertaining to six unrelated software systems. We then use our benchmark to measure the effectiveness of a number of approaches, ranging from lightweight approaches based on regular expressions to full-fledged information retrieval approaches.
  • Keywords
    electronic mail; information retrieval; program diagnostics; software maintenance; e-mails; information retrieval approaches; software artifacts; software maintenance; source code artifacts; trace-ability issue; unrelated software systems; Benchmark testing; Electronic mail; Java; Joining processes; Large scale integration; Object oriented modeling; Software systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, 2010 ACM/IEEE 32nd International Conference on
  • Conference_Location
    Cape Town
  • ISSN
    0270-5257
  • Print_ISBN
    978-1-60558-719-6
  • Type

    conf

  • DOI
    10.1145/1806799.1806855
  • Filename
    6062105