DocumentCode
3724425
Title
Mining Temporal Information to Improve Duplication Detection on Bug Reports
Author
Chao-Yuan Lee;Dan-Dan Hu;Zhong-Yi Feng;Cheng-Zen Yang
Author_Institution
Dept. of Comput. Sci. &
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
551
Lastpage
555
Abstract
Due to the vast amount of bug reports in many large-scale open source projects, detecting duplicate reports has received much attention in recent years. Textual information and different kinds of categorical information have been leveraged in many detection methods. Although past studies have shown the influence of the temporal information in the submission interval of duplicate bug reports, only few methods consider the temporal information in its plain interval form. In this paper, we propose a temporal model to improve the detection performance by considering the submission time and the version information. We discuss the effectiveness by implementing it based on a prestigious BM25Fext-based method. Empirical experiments have been conducted on a dataset collected from the Eclipse open source project. Compared with the baseline BM25Fext-based method, the proposed scheme can obtain performance improvement consistently.
Keywords
"Testing","Feature extraction","Natural language processing","Software","Data mining","Frequency control","Informatics"
Publisher
ieee
Conference_Titel
Advanced Applied Informatics (IIAI-AAI), 2015 IIAI 4th International Congress on
Print_ISBN
978-1-4799-9957-6
Type
conf
DOI
10.1109/IIAI-AAI.2015.180
Filename
7373969
Link To Document