• DocumentCode
    2524205
  • Title

    Code Similarities Beyond Copy & Paste

  • Author

    Juergens, Elmar ; Deissenboeck, Florian ; Hummel, Benjamin

  • Author_Institution
    Inst. fur Inf., Tech. Univ. Munchen, Garching, Germany
  • fYear
    2010
  • fDate
    15-18 March 2010
  • Firstpage
    78
  • Lastpage
    87
  • Abstract
    Redundant source code hinders software maintenance, since updates have to be performed in multiple places. This holds independent of whether redundancy was created by copy&paste or by independent development of behaviorally similar code. Existing clone detection tools successfully discover syntactically similar redundant code. They thus work well for redundancy that has been created by copy&paste. But: how syntactically similar is behaviorally similar code of independent origin? This paper presents the results of a controlled experiment that demonstrates that behaviorally similar code of independent origin is highly unlikely to be syntactically similar. In fact, it is so syntactically different, that existing clone detection approaches cannot identify more than 1% of such redundancy. This is unfortunate, as manual inspections of open source software indicate that behaviorally similar code of independent origin does exist in practice and does present problems to maintenance.
  • Keywords
    public domain software; software maintenance; code similarities; copy&paste; open source software; redundant source code; software maintenance; Cloning; Detectors; Inspection; Java; Manuals; Redundancy; Software maintenance; code similarity; open source;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance and Reengineering (CSMR), 2010 14th European Conference on
  • Conference_Location
    Madrid
  • ISSN
    1534-5351
  • Print_ISBN
    978-1-61284-369-8
  • Electronic_ISBN
    1534-5351
  • Type

    conf

  • DOI
    10.1109/CSMR.2010.33
  • Filename
    5714422