DocumentCode
2155398
Title
On the role of the nouns in IR-based traceability recovery
Author
Capobianco, Giovanni ; De Lucia, Andrea ; Oliveto, Rocco ; Panichella, Annibale ; Panichella, Sebastiano
Author_Institution
STAT, Univ. of Molise, Pesche
fYear
2009
fDate
17-19 May 2009
Firstpage
148
Lastpage
157
Abstract
The intensive human effort needed to manually manage traceability information has increased the interest in utilising semi-automated traceability recovery techniques. This paper presents a simple way to improve the accuracy of traceability recovery methods based on information retrieval techniques. The proposed method acts on the artefact indexing considering only the nouns contained in the artefact content to define the semantics of an artefact. The rationale behind such a choice is that the language used in software documents can be classified as a sectorial language, where the terms that provide more indication on the semantics of a document are the nouns. The results of a reported case study demonstrate that the proposed artefact indexing significantly improves the accuracy of traceability recovery methods based on the probabilistic or vector space based IR models.
Keywords
indexing; information retrieval; software engineering; IR-based traceability recovery; artefact indexing; information retrieval techniques; software documents; software engineering community; vector space; Computer errors; Engineering management; Humans; Indexing; Information analysis; Information management; Information retrieval; Natural languages; Software engineering; Software maintenance;
fLanguage
English
Publisher
ieee
Conference_Titel
Program Comprehension, 2009. ICPC '09. IEEE 17th International Conference on
Conference_Location
Vancouver, BC
ISSN
1092-8138
Print_ISBN
978-1-4244-3998-0
Electronic_ISBN
1092-8138
Type
conf
DOI
10.1109/ICPC.2009.5090038
Filename
5090038
Link To Document