• DocumentCode
    2074443
  • Title

    A machine learning approach for tracing regulatory codes to product specific requirements

  • Author

    Cleland-Huang, Jane ; Czauderna, Adam ; Gibiec, Marek ; Emenecker, John

  • Author_Institution
    Syst. & Requirements Eng. Center (SAREC), DePaul Univ., Chicago, IL, USA
  • Volume
    1
  • fYear
    2010
  • fDate
    2-8 May 2010
  • Firstpage
    155
  • Lastpage
    164
  • Abstract
    Regulatory standards, designed to protect the safety, security, and privacy of the public, govern numerous areas of software intensive systems. Project personnel must therefore demonstrate that an as-built system meets all relevant regulatory codes. Current methods for demonstrating compliance rely either on after-the-fact audits, which can lead to significant refactoring when regulations are not met, or else require analysts to construct and use traceability matrices to demonstrate compliance. Manual tracing can be prohibitively time-consuming; however automated trace retrieval methods are not very effective due to the vocabulary mismatches that often occur between regulatory codes and product level requirements. This paper introduces and evaluates two machine-learning methods, designed to improve the quality of traces generated between regulatory codes and product level requirements. The first approach uses manually created traceability matrices to train a trace classifier, while the second approach uses web-mining techniques to reconstruct the original trace query. The techniques were evaluated against security regulations from the USA government´s Health Insurance Privacy and Portability Act (HIPAA) traced against ten healthcare related requirements specifications. Results demonstrated improvements for the subset of HIPAA regulations that exhibited high fan-out behavior across the requirements datasets.
  • Keywords
    health care; learning (artificial intelligence); program diagnostics; security of data; software maintenance; HIPAA; Health Insurance Privacy and Portability Act; healthcare; machine learning approach; product specific requirements; regulatory codes; security regulations; software intensive systems; software refactoring; traceability matrices; web-mining techniques; Encryption; Measurement; Medical services; Probabilistic logic; Software; Training; regulatory compliance; requirements classification; traceability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, 2010 ACM/IEEE 32nd International Conference on
  • Conference_Location
    Cape Town
  • ISSN
    0270-5257
  • Print_ISBN
    978-1-60558-719-6
  • Type

    conf

  • DOI
    10.1145/1806799.1806825
  • Filename
    6062083