Title :
On the role of the nouns in IR-based traceability recovery
Author :
Capobianco, Giovanni ; De Lucia, Andrea ; Oliveto, Rocco ; Panichella, Annibale ; Panichella, Sebastiano
Author_Institution :
STAT, Univ. of Molise, Pesche
Abstract :
The intensive human effort needed to manually manage traceability information has increased the interest in utilising semi-automated traceability recovery techniques. This paper presents a simple way to improve the accuracy of traceability recovery methods based on information retrieval techniques. The proposed method acts on the artefact indexing considering only the nouns contained in the artefact content to define the semantics of an artefact. The rationale behind such a choice is that the language used in software documents can be classified as a sectorial language, where the terms that provide more indication on the semantics of a document are the nouns. The results of a reported case study demonstrate that the proposed artefact indexing significantly improves the accuracy of traceability recovery methods based on the probabilistic or vector space based IR models.
Keywords :
indexing; information retrieval; software engineering; IR-based traceability recovery; artefact indexing; information retrieval techniques; software documents; software engineering community; vector space; Computer errors; Engineering management; Humans; Indexing; Information analysis; Information management; Information retrieval; Natural languages; Software engineering; Software maintenance;
Conference_Titel :
Program Comprehension, 2009. ICPC '09. IEEE 17th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4244-3998-0
Electronic_ISBN :
1092-8138
DOI :
10.1109/ICPC.2009.5090038