DocumentCode :
2745582
Title :
Detection of Duplicate Defect Reports Using Natural Language Processing
Author :
Runeson, Per ; Alexandersson, Magnus ; Nyholm, Oskar
Author_Institution :
Software Eng. Res. Group, Lund Univ., Lund
fYear :
2007
fDate :
20-26 May 2007
Firstpage :
499
Lastpage :
510
Abstract :
Defect reports are generated from various testing and development activities in software engineering. Sometimes two reports are submitted that describe the same problem, leading to duplicate reports. These reports are mostly written in structured natural language, and as such, it is hard to compare two reports for similarity with formal methods. In order to identify duplicates, we investigate using natural language processing (NLP) techniques to support the identification. A prototype tool is developed and evaluated in a case study analyzing defect reports at Sony Ericsson mobile communications. The evaluation shows that about 2/3 of the duplicates can possibly be found using the NLP techniques. Different variants of the techniques provide only minor result differences, indicating a robust technology. User testing shows that the overall attitude towards the technique is positive and that it has a growth potential.
Keywords :
natural language processing; program testing; software prototyping; Sony Ericsson Mobile Communication; duplicate defect report detection; formal method; natural language processing; prototype tool; software engineering; software testing; user testing; Failure analysis; Mobile communication; Natural language processing; Natural languages; Prototypes; Relays; Robustness; Software engineering; Software testing; Vocabulary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Software Engineering, 2007. ICSE 2007. 29th International Conference on
Conference_Location :
Minneapolis, MN
ISSN :
0270-5257
Print_ISBN :
0-7695-2828-7
Type :
conf
DOI :
10.1109/ICSE.2007.32
Filename :
4222611
Link To Document :
بازگشت