• DocumentCode
    3237127
  • Title

    From Whence It Came: Detecting Source Code Clones by Analyzing Assembler

  • Author

    Davis, Ian J. ; Godfrey, Michael W.

  • Author_Institution
    David R. Cheriton Sch. of Comput. Sci., Univ. of Waterloo, Waterloo, ON, Canada
  • fYear
    2010
  • fDate
    13-16 Oct. 2010
  • Firstpage
    242
  • Lastpage
    246
  • Abstract
    To date, most clone detection techniques have concentrated on various forms of source code analysis, often by analyzing token streams. In this paper, we introduce a complementary technique of analyzing generated assembler for clones. This approach is appealing as it is mostly impervious to trivial changes in the source, with compilation serving as a kind of normalization technique. We have built detectors to analyze both Java VM code as well as GCC Linux assembler for C and C++. In the paper, we describe our approach and show how it can serve as a valuable complementary semantic approach to syntactic source code based detection.
  • Keywords
    C++ language; Java; Linux; program assemblers; program compilers; source coding; C; C++; GCC Linux assembler; Java VM code; assembler analysis; clone detection technique; normalization technique; source code analysis; syntactic source code based detection; Arrays; Cloning; Image edge detection; Java; Reverse engineering; Software; Syntactics; Clone Detection; Reverse Engineering;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Reverse Engineering (WCRE), 2010 17th Working Conference on
  • Conference_Location
    Beverly, MA
  • ISSN
    1095-1350
  • Print_ISBN
    978-1-4244-8911-4
  • Type

    conf

  • DOI
    10.1109/WCRE.2010.35
  • Filename
    5645565