DocumentCode :
3178938
Title :
On the Equivalence of Information Retrieval Methods for Automated Traceability Link Recovery
Author :
Oliveto, Rocco ; Gethers, Malcom ; Poshyvanyk, Denys ; De Lucia, Andrea
Author_Institution :
Dept. of Math. & Inf., Univ. of Salerno, Fisciano, Italy
fYear :
2010
fDate :
June 30 2010-July 2 2010
Firstpage :
68
Lastpage :
71
Abstract :
We present an empirical study to statistically analyze the equivalence of several traceability recovery methods based on Information Retrieval (IR) techniques. The analysis is based on Principal Component Analysis and on the analysis of the overlap of the set of candidate links provided by each method. The studied techniques are the Jensen-Shannon (JS) method, Vector Space Model (VSM), Latent Semantic Indexing (LSI), and Latent Dirichlet Allocation (LDA). The results show that while JS, VSM, and LSI are almost equivalent, LDA is able to capture a dimension unique to the set of techniques which we considered.
Keywords :
information retrieval; principal component analysis; Jensen-Shannon method; automated traceability link recovery; information retrieval methods; latent Dirichlet allocation; principal component analysis; statistical analysis; vector space model; Computer science; Documentation; Indexing; Informatics; Information retrieval; Large scale integration; Linear discriminant analysis; Mathematics; Principal component analysis; Software maintenance; Empirical Studies; Information Retrieval; Traceability Recovery;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Program Comprehension (ICPC), 2010 IEEE 18th International Conference on
Conference_Location :
Braga, Minho
ISSN :
1092-8138
Print_ISBN :
978-1-4244-7604-6
Electronic_ISBN :
1092-8138
Type :
conf
DOI :
10.1109/ICPC.2010.20
Filename :
5521762
Link To Document :
بازگشت