DocumentCode
2727305
Title
Duplication Detection for Software Bug Reports Based on BM25 Term Weighting
Author
Cheng-Zen Yang ; Hung-Hsueh Du ; Sin-Sian Wu ; Ing-Xiang Chen
Author_Institution
Dept. of Comput. Sci. & Eng., Yuan Ze Univ., Chungli, Taiwan
fYear
2012
fDate
16-18 Nov. 2012
Firstpage
33
Lastpage
38
Abstract
Handling bug reports is an important issue in software maintenance. Recently, detection on duplicate bug reports has received much attention. There are two main reasons. First, duplicate bug reports may waste human resource to process these redundant reports. Second, duplicate bug reports may provide abundant information for further software maintenance. In the past studies, many schemes have been proposed using the information retrieval and natural language processing techniques. In this thesis, we propose a novel detection scheme based on a BM25 term weighting scheme. We have conducted empirical experiments on three open source projects, Apache, ArgoUML, and SVN. The experimental results show that the BM25-based scheme can effectively improve the detection performance in nearly all cases.
Keywords
Unified Modeling Language; program debugging; program testing; public domain software; software maintenance; software quality; Apache; ArgoUML; BM25 term weighting scheme; SVN; detection performance; duplicate bug report detection; duplication detection; human resource wastage; information retrieval; natural language processing techniques; open source projects; software bug report handling; software maintenance; Atmospheric measurements; Information retrieval; Natural language processing; Particle measurements; Software; Unified modeling language; Vectors; BM25; bug reports; duplication detection; term weighting;
fLanguage
English
Publisher
ieee
Conference_Titel
Technologies and Applications of Artificial Intelligence (TAAI), 2012 Conference on
Conference_Location
Tainan
Print_ISBN
978-1-4673-4976-5
Type
conf
DOI
10.1109/TAAI.2012.20
Filename
6395002
Link To Document