DocumentCode
2139775
Title
Recovering a Balanced Overview of Topics in a Software Domain
Author
Kelly, Matthew B. ; Alexander, Jason S. ; Adams, Bram ; Hassan, Ahmed E.
Author_Institution
Sch. of Comput., Queen´´s Univ., Kingston, ON, Canada
fYear
2011
fDate
25-26 Sept. 2011
Firstpage
135
Lastpage
144
Abstract
Domain analysis is a crucial step in the development of product lines and software reuse in general, in which domain experts try to identify the commonalities and variability between different products of a particular domain. This identification is challenging, since it requires significant manual analysis of requirements, design documents, and source code. In order to support domain analysts, this paper proposes to use topic modeling techniques to automatically identify common and unique concepts (topics) from the source code of different software products in a domain. An empirical case study of 19 projects, spread across the domains of web browsers and operating systems (totaling over 39 MLOC), shows that our approach is able to identify commonalities and variabilities at different levels of granularity (sub-domain and domain). In addition, we show how the commonalities are evenly spread across all projects of the domain.
Keywords
data analysis; software reusability; Web browser; domain analysis; operating systems; product line development; software domain; software product; software reuse; topic balanced overview recovery; topic modeling technique; Analytical models; Browsers; Computer architecture; Operating systems; Software systems; Unified modeling language; domain analysis; empirical study; topic modeling;
fLanguage
English
Publisher
ieee
Conference_Titel
Source Code Analysis and Manipulation (SCAM), 2011 11th IEEE International Working Conference on
Conference_Location
Williamsburg, VI
Print_ISBN
978-1-4577-0932-6
Type
conf
DOI
10.1109/SCAM.2011.23
Filename
6065172
Link To Document