DocumentCode :
3407190
Title :
Tag recommendation in software information sites
Author :
Xin Xia ; Lo, Daniel ; Xinyu Wang ; Bo Zhou
fYear :
2013
fDate :
18-19 May 2013
Firstpage :
287
Lastpage :
296
Abstract :
Nowadays, software engineers use a variety of online media to search and become informed of new and interesting technologies, and to learn from and help one another. We refer to these kinds of online media which help software engineers improve their performance in software development, maintenance and test processes as software information sites. It is common to see tags in software information sites and many sites allow users to tag various objects with their own words. Users increasingly use tags to describe the most important features of their posted contents or projects. In this paper, we propose TagCombine, an automatic tag recommendation method which analyzes objects in software information sites. TagCombine has 3 different components: 1. multilabel ranking component which considers tag recommendation as a multi-label learning problem; 2. similarity based ranking component which recommends tags from similar objects; 3. tag-term based ranking component which considers the relationship between different terms and tags, and recommends tags after analyzing the terms in the objects. We evaluate TagCombine on 2 software information sites, StackOverflow and Freecode, which contain 47,668 and 39,231 text documents, respectively, and 437 and 243 tags, respectively. Experiment results show that for StackOverflow, our TagCombine achieves recall@5 and recall@10 scores of 0.5964 and 0.7239, respectively; For Freecode, it achieves recall@5 and recall@10 scores of 0.6391 and 0.7773, respectively. Moreover, averaging over StackOverflow and Freecode results, we improve TagRec proposed by Al-Kofahi et al. by 22.65% and 14.95%, and the tag recommendation method proposed by Zangerle et al. by 18.5% and 7.35% for recall@5 and recall@10 scores.
Keywords :
learning (artificial intelligence); program testing; recommender systems; social networking (online); software maintenance; text analysis; Freecode; StackOverflow; TagCombine; TagRec; automatic tag recommendation method; multilabel learning problem; multilabel ranking component; online media; similarity based ranking component; software development; software engineers; software information sites; software maintenance; software test processes; tag-term based ranking component; text documents; Educational institutions; Media; Prediction algorithms; Search problems; Software; Software algorithms; Vectors; Online Media; Software Information Sites; Tag Recommendation; TagCombine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mining Software Repositories (MSR), 2013 10th IEEE Working Conference on
Conference_Location :
San Francisco, CA
ISSN :
2160-1852
Print_ISBN :
978-1-4799-0345-0
Type :
conf
DOI :
10.1109/MSR.2013.6624040
Filename :
6624040
Link To Document :
بازگشت