• DocumentCode
    327102
  • Title

    IDENTIFIED (Integrated Dictionary-based Extraction of Non-language-dependent Token Information for Forensic Identification, Examination, and Discrimination): a dictionary-based system for extracting source code metrics for software forensics

  • Author

    Gray, Andrew ; Sallis, Philip ; MacDonell, Stephen

  • Author_Institution
    Dept. of Inf. Sci., Otago Univ., Dunedin, New Zealand
  • fYear
    1998
  • fDate
    26-29 Jan 1998
  • Firstpage
    252
  • Lastpage
    259
  • Abstract
    The frequency and severity of computer-based attacks such as viruses and worms, logic bombs, trojan horses, computer fraud, and plagiarism of software code have all become of increasing concern to many of those involved with information systems. Part of the difficulty experienced in collecting evidence regarding the attack or theft in such situations has been the definition and collection of appropriate measurements to use in models of authorship, With this purpose in mind a system called IDENTIFIED is being developed to assist with the task of software forensics which is the use of software code authorship analysis for legal or official purposes. IDENTIFIED uses combinations of wildcards and special characters to define count-based metrics, allows for hierarchical metametric definitions, automates much of the file handling task, extracts metric values from source code, and assists with the analysis and modelling processes. It is hoped that the availability of such tools will encourage more detailed research into this area of ever-increasing importance
  • Keywords
    computer crime; computer viruses; fraud; legislation; software metrics; IDENTIFIED; analysis processes; computer fraud; computer-based attacks; count-based metrics; dictionary-based system; file handling task automation; forensic discrimination; forensic examination; forensic identification; hierarchical metametric definitions; information systems; integrated dictionary-based non-language-dependent token information extraction; legal purposes; logic bombs; metric value extraction; modelling processes; official purposes; plagiarism; software code authorship analysis; software forensics; source code metrics extraction; trojan horses; viruses; wildcards; worms; Computer crime; Computer viruses; Computer worms; Data mining; Frequency; Information systems; Invasive software; Logic; Plagiarism; Weapons;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering: Education & Practice, 1998. Proceedings. 1998 International Conference
  • Conference_Location
    Dunedin
  • Print_ISBN
    0-8186-8828-9
  • Type

    conf

  • DOI
    10.1109/SEEP.1998.707658
  • Filename
    707658