DocumentCode :
1999674
Title :
Identifying Word Relations in Software: A Comparative Study of Semantic Similarity Tools
Author :
Sridhara, Giriprasad ; Hill, Emily ; Pollock, Lori ; Vijay-Shanker, K.
Author_Institution :
Dept. of Comput. & Inf. Sci., Delaware Univ., Newark, DE
fYear :
2008
fDate :
10-13 June 2008
Firstpage :
123
Lastpage :
132
Abstract :
Modern software systems are typically large and complex, making comprehension of these systems extremely difficult. Experienced programmers comprehend code by seamlessly processing synonyms and other word relations. Thus, we believe that automated comprehension and software tools can be significantly improved by leveraging word relations in software. In this paper, we perform a comparative study of six state of the art, English-based semantic similarity techniques and evaluate their effectiveness on words from the comments and identifiers in software. Our results suggest that applying English-based semantic similarity techniques to software without any customization could be detrimental to the performance of the client software tools. We propose strategies to customize the existing semantic similarity techniques to software, and describe how various program comprehension tools can benefit from word relation information.
Keywords :
natural language processing; programming language semantics; reverse engineering; software tools; English-based semantic similarity; automated code comprehension; program comprehension; semantic similarity tool; software system; software tool; word relation; Databases; Natural languages; Performance evaluation; Programming profession; Software maintenance; Software performance; Software quality; Software systems; Software tools; Vehicles; automated natural language analysis; comparative study; semantic similarity techniques; tools; word relations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Program Comprehension, 2008. ICPC 2008. The 16th IEEE International Conference on
Conference_Location :
Amsterdam
ISSN :
1092-8138
Print_ISBN :
978-0-7695-3176-2
Type :
conf
DOI :
10.1109/ICPC.2008.18
Filename :
4556124
Link To Document :
بازگشت