• DocumentCode
    2675081
  • Title

    Physical and conceptual identifier dispersion: Measures and relation to fault proneness

  • Author

    Arnaoudova, Venera ; Eshkevari, Laleh ; Oliveto, Rocco ; Guéhéneuc, Yann-Gaël ; Antoniol, Giuliano

  • Author_Institution
    SOCCER Lab., Ecole Polytech. de Montreal, Montréal, QC, Canada
  • fYear
    2010
  • fDate
    12-18 Sept. 2010
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Poorly-chosen identifiers have been reported in the literature as misleading and increasing the program comprehension effort. Identifiers are composed of terms, which can be dictionary words, acronyms, contractions, or simple strings. We conjecture that the use of identical terms in different contexts may increase the risk of faults. We investigate our conjecture using a measure combining term entropy and term context coverage to study whether certain terms increase the odds ratios of methods to be fault-prone. Entropy measures the physical dispersion of terms in a program: the higher the entropy, the more scattered across the program the terms. Context coverage measures the conceptual dispersion of terms: the higher their context coverage, the more unrelated the methods using them. We compute term entropy and context coverage of terms extracted from identifiers in Rhino 1.4R3 and ArgoUML 0.16. We show statistically that methods containing terms with high entropy and context coverage are more fault-prone than others.
  • Keywords
    Unified Modeling Language; entropy; software fault tolerance; software metrics; statistical analysis; ArgoUML 0.16; Rhino 1.4R3; conceptual identifier dispersion; fault proneness; physical identifier dispersion; program comprehension; term context coverage; term entropy; Conferences; Context; Correlation; Entropy; Measurement; Presses; Software engineering; Entropy; Fault Models; Information Retrieval; Program Comprehension; Source Code Identifiers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance (ICSM), 2010 IEEE International Conference on
  • Conference_Location
    Timisoara
  • ISSN
    1063-6773
  • Print_ISBN
    978-1-4244-8630-4
  • Electronic_ISBN
    1063-6773
  • Type

    conf

  • DOI
    10.1109/ICSM.2010.5609748
  • Filename
    5609748