• DocumentCode
    1928712
  • Title

    Linear complexity object-oriented similarity for clone detection and software evolution analyses

  • Author

    Merlo, E. ; Antoniol, G. ; Di Penta, M. ; Rollo, V.F.

  • Author_Institution
    Ecole Polytechnique de Montreal, Que., Canada
  • fYear
    2004
  • fDate
    11-14 Sept. 2004
  • Firstpage
    412
  • Lastpage
    416
  • Abstract
    With the widespread adoption of object-oriented technologies, the lack of computationally efficient and scalable approaches is limiting the ability to model and analyze the history of large object-oriented software systems. This paper proposes an approximate representation of object-oriented code characteristics, inspired by pattern recognition centroids for clustering. An interesting application of such a representation is a linear-time complexity algorithm to detect duplicate or nearly duplicated code in object-oriented systems. The algorithm accuracy and time complexity were assessed on 11 releases of a large software system, the Eclipse framework.
  • Keywords
    computational complexity; data flow analysis; object-oriented programming; pattern recognition; software maintenance; software prototyping; Eclipse framework; clone detection; large object-oriented software system; linear complexity object-oriented similarity; linear-time complexity algorithm; object-oriented code characteristics; object-oriented software evolution; pattern recognition centroids; software evolution analysis; source code analysis; time complexity; Application software; Cloning; Clustering algorithms; History; Object oriented modeling; Open source software; Pattern recognition; Software algorithms; Software maintenance; Software systems;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance, 2004. Proceedings. 20th IEEE International Conference on
  • ISSN
    1063-6773
  • Print_ISBN
    0-7695-2213-0
  • Type

    conf

  • DOI
    10.1109/ICSM.2004.1357826
  • Filename
    1357826