Title :
Evaluating the specificity of text retrieval queries to support software engineering tasks
Author :
Haiduc, Sonia ; Bavota, Gabriele ; Oliveto, Rocco ; Marcus, Andrian ; De Lucia, Andrea
Author_Institution :
Comput. Sci. Dept., Wayne State Univ., Detroit, MI, USA
Abstract :
Text retrieval approaches have been used to address many software engineering tasks. In most cases, their use involves issuing a textual query to retrieve a set of relevant software artifacts from the system. The performance of all these approaches depends on the quality of the given query (i.e., its ability to describe the information need in such a way that the relevant software artifacts are retrieved during the search). Currently, the only way to tell that a query failed to lead to the expected software artifacts is by investing time and effort in analyzing the search results. In addition, it is often very difficult to ascertain what part of the query leads to poor results. We propose a novel pre-retrieval metric, which reflects the quality of a query by measuring the specificity of its terms. We exemplify the use of the new specificity metric on the task of concept location in source code. A preliminary empirical study shows that our metric is a good effort predictor for text retrieval-based concept location, outperforming existing techniques from the field of natural language document retrieval.
Keywords :
natural languages; query processing; software metrics; text analysis; natural language text; preretrieval metric; query quality; software artifacts; software engineering tasks; source code; specificity evaluation; specificity metric; text retrieval queries; text retrieval-based concept location; Context; Correlation; Entropy; Information retrieval; Measurement; Natural languages; Software; Concept location; Query specificity; Text retrieval;
Conference_Titel :
Software Engineering (ICSE), 2012 34th International Conference on
Conference_Location :
Zurich
Print_ISBN :
978-1-4673-1066-6
Electronic_ISBN :
0270-5257
DOI :
10.1109/ICSE.2012.6227101