Title :
Comparing hierarchical dirichlet process with latent dirichlet allocation in bug report multiclass classification
Author :
Limsettho, Nachai ; Hata, Hiroki ; Matsumoto, Ken-ichi
Author_Institution :
Grad. Sch. of Inf. Sci., Nara Inst. of Sci. & Technol., Nara, Japan
fDate :
June 30 2014-July 2 2014
Abstract :
Bug reports play essential roles in many software engineering tasks. Since validity and performance of these tasks definitely rely on the quality of bug reports, accurate information from bug reports is very important. However, as found in previous study, significant numbers of reports classified as bug are not really a bug. Recent studies proposed techniques to automatically classify bug reports into binary classes, yet there is still more to desire. These bug reports can be classified into multiple classes, which could help to identify what these reports are actually about. Moreover, previous study only looks into one possibility of topic modeling, that is, Latent Dirichlet Allocation (LDA). While LDA has its advantage, parameter tuning is required. In this paper, we propose a nonparametric approach to automatically classify bug reports with, another topic modeling method, Hierarchical Dirichlet Process (HDP). The result indicates that our nonparametric approach performance is comparable to the parametric one. We also examine various aspects of LDA to provide more thoroughly understanding of this process.
Keywords :
pattern classification; program debugging; software maintenance; HDP; LDA; bug report multiclass classification; hierarchical Dirichlet process; latent Dirichlet allocation; nonparametric approach; topic modeling method; Accuracy; Data mining; Data models; Logistics; Niobium; Resource management; Tuning; Hierarchical Dirichlet Process; Latent Dirchlet Allocation; bug classification; bug reports; multiclass classification; topic modeling;
Conference_Titel :
Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2014 15th IEEE/ACIS International Conference on
Conference_Location :
Las Vegas, NV
DOI :
10.1109/SNPD.2014.6888695