Title :
Combining Probabilistic Ranking and Latent Semantic Indexing for Feature Identification
Author :
Poshyvanyk, Denys ; Guéhéneuc, Yann-Gaël ; Marcus, Andrian ; Antoniol, Giuliano ; Rajlich, Václav
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI
Abstract :
The paper recasts the problem of feature location in source code as a decision-making problem in the presence of uncertainty. The main contribution consists in the combination of two existing techniques for feature location in source code. Both techniques provide a set of ranked facts from the software, as result to the feature identification problem. One of the techniques is based on a scenario based probabilistic ranking of events observed while executing a program under given scenarios. The other technique is defined as an information retrieval task, based on the latent semantic indexing of the source code. We show the viability and effectiveness of the combined technique with two case studies. A first case study is a replication of feature identification in Mozilla, which allows us to directly compare the results with previously published data. The other case study is a bug location problem in Mozilla. The results show that the combined technique improves feature identification significantly with respect to each technique used independently
Keywords :
indexing; information retrieval; probability; program debugging; program diagnostics; software prototyping; Mozilla; bug location; decision-making; feature identification; feature location; information retrieval task; latent semantic indexing; probabilistic ranking; source code; Computer science; Data mining; Decision making; Indexing; Information analysis; Information retrieval; Software debugging; Software engineering; Software systems; Uncertainty;
Conference_Titel :
Program Comprehension, 2006. ICPC 2006. 14th IEEE International Conference on
Conference_Location :
Athens
Print_ISBN :
0-7695-2601-2
DOI :
10.1109/ICPC.2006.17