• DocumentCode
    2727305
  • Title

    Duplication Detection for Software Bug Reports Based on BM25 Term Weighting

  • Author

    Cheng-Zen Yang ; Hung-Hsueh Du ; Sin-Sian Wu ; Ing-Xiang Chen

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Yuan Ze Univ., Chungli, Taiwan
  • fYear
    2012
  • fDate
    16-18 Nov. 2012
  • Firstpage
    33
  • Lastpage
    38
  • Abstract
    Handling bug reports is an important issue in software maintenance. Recently, detection on duplicate bug reports has received much attention. There are two main reasons. First, duplicate bug reports may waste human resource to process these redundant reports. Second, duplicate bug reports may provide abundant information for further software maintenance. In the past studies, many schemes have been proposed using the information retrieval and natural language processing techniques. In this thesis, we propose a novel detection scheme based on a BM25 term weighting scheme. We have conducted empirical experiments on three open source projects, Apache, ArgoUML, and SVN. The experimental results show that the BM25-based scheme can effectively improve the detection performance in nearly all cases.
  • Keywords
    Unified Modeling Language; program debugging; program testing; public domain software; software maintenance; software quality; Apache; ArgoUML; BM25 term weighting scheme; SVN; detection performance; duplicate bug report detection; duplication detection; human resource wastage; information retrieval; natural language processing techniques; open source projects; software bug report handling; software maintenance; Atmospheric measurements; Information retrieval; Natural language processing; Particle measurements; Software; Unified modeling language; Vectors; BM25; bug reports; duplication detection; term weighting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Technologies and Applications of Artificial Intelligence (TAAI), 2012 Conference on
  • Conference_Location
    Tainan
  • Print_ISBN
    978-1-4673-4976-5
  • Type

    conf

  • DOI
    10.1109/TAAI.2012.20
  • Filename
    6395002