• DocumentCode
    3406901
  • Title

    The Eclipse and Mozilla defect tracking dataset: A genuine dataset for mining bug information

  • Author

    Lamkanfi, A. ; Perez, J.M. ; Demeyer, S.

  • Author_Institution
    Univ. of Antwerp, Antwerp, Belgium
  • fYear
    2013
  • fDate
    18-19 May 2013
  • Firstpage
    203
  • Lastpage
    206
  • Abstract
    The analysis of bug reports is an important subfield within the mining software repositories community. It explores the rich data available in defect tracking systems to uncover interesting and actionable information about the bug triaging process. While bug data is readily accessible from systems like Bugzilla and JIRA, a common database schema and a curated dataset could significantly enhance future research because it allows for easier replication. Consequently, in this paper we propose the Eclipse and Mozilla Defect Tracking Dataset, a representative database of bug data, filtered to contain only genuine defects (i.e., no feature requests) and designed to cover the whole bug-triage life cycle (i.e., store all intermediate actions). We have used this dataset ourselves for predicting bug severity, for studying bug-fixing time and for identifying erroneously assigned components. Sharing these data with the rest of the community will allow for reproducibility, validation and comparison of the results obtained in bug-report analyses and experiments.
  • Keywords
    data mining; information filtering; program debugging; Eclipse; Mozilla; bug data database; bug information; bug severity prediction; bug-fixing time; bug-report analyses; bug-triage life cycle; data filtering; defect tracking dataset; defect tracking systems; erroneously assigned component identification; mining software repositories community; Communities; Computer bugs; Data mining; Databases; Software; Software engineering; XML; Bug reports; Dataset; Defect Tracking;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Mining Software Repositories (MSR), 2013 10th IEEE Working Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    2160-1852
  • Print_ISBN
    978-1-4799-0345-0
  • Type

    conf

  • DOI
    10.1109/MSR.2013.6624028
  • Filename
    6624028