DocumentCode :
3644675
Title :
Categories of Source Code in Industrial Systems
Author :
Tiago L. Alves
Author_Institution :
Software Improvement Group, Amsterdam, Netherlands
fYear :
2011
Firstpage :
335
Lastpage :
338
Abstract :
The categorization of source code artifacts affects how the overall product is measured and consequently how these measurements are interpreted. When measuring complexity, for instance, failing to distinguish test and generated code will affect complexity measurements possibly leading to an erroneous interpretation of the overall product complexity. Although categorization problems are known, there seems to be little attention given to this subject in the literature. In this paper, we introduce a categorization for source code artifacts and present an empirical study providing evidence of each category. Artifacts are divided into production and test code, and then these categories are sub-divided into manually-maintained, generated, library, and example code. By analyzing 80 Java and C# industrial systems, we have found evidence of the majority of categories. We show that in average production code only accounts for 60% of a product volume. Also, we have found that for some systems the overall percentage of test and generated code, each can account to over 70% and of library code to over 40%. Finally we discuss the difficulties of distinguishing source code artifacts and conclude with directions for further research.
Keywords :
"Production","Libraries","Manuals","Software","Software measurement","Java"
Publisher :
ieee
Conference_Titel :
Empirical Software Engineering and Measurement (ESEM), 2011 International Symposium on
ISSN :
1938-6451
Print_ISBN :
978-1-4577-2203-5
Type :
conf
DOI :
10.1109/ESEM.2011.42
Filename :
6092582
Link To Document :
بازگشت