• DocumentCode
    2211320
  • Title

    An empirical study on the risks of using off-the-shelf techniques for processing mailing list data

  • Author

    Bettenburg, Nicolas ; Shihab, Emad ; Hassan, Ahmed E.

  • Author_Institution
    Software Anal. & Intell. Lab., Queen´´s Univ., Kingston, ON, Canada
  • fYear
    2009
  • fDate
    20-26 Sept. 2009
  • Firstpage
    539
  • Lastpage
    542
  • Abstract
    Mailing list repositories contain valuable information about the history of a project. Research is starting to mine this information to support developers and maintainers of long-lived software projects. However, such information exists as unstructured data that needs special processing before it can be studied. In this paper, we identify several challenges that arise when using off-the-shelf techniques for processing mailing list data. Our study highlights the importance of proper processing of mailing list data to ensure accurate research results.
  • Keywords
    electronic mail; software maintenance; electronic mail; mailing list data processing; mailing list repositories; off-the-shelf techniques; unstructured data; Computer networks; Data mining; Electronic mail; File servers; History; Information analysis; Risk analysis; Software maintenance; Tag clouds; Yarn;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance, 2009. ICSM 2009. IEEE International Conference on
  • Conference_Location
    Edmonton, AB
  • ISSN
    1063-6773
  • Print_ISBN
    978-1-4244-4897-5
  • Electronic_ISBN
    1063-6773
  • Type

    conf

  • DOI
    10.1109/ICSM.2009.5306383
  • Filename
    5306383