Title :
A machine learning approach for tracing regulatory codes to product specific requirements
Author :
Cleland-Huang, Jane ; Czauderna, Adam ; Gibiec, Marek ; Emenecker, John
Author_Institution :
Syst. & Requirements Eng. Center (SAREC), DePaul Univ., Chicago, IL, USA
Abstract :
Regulatory standards, designed to protect the safety, security, and privacy of the public, govern numerous areas of software intensive systems. Project personnel must therefore demonstrate that an as-built system meets all relevant regulatory codes. Current methods for demonstrating compliance rely either on after-the-fact audits, which can lead to significant refactoring when regulations are not met, or else require analysts to construct and use traceability matrices to demonstrate compliance. Manual tracing can be prohibitively time-consuming; however automated trace retrieval methods are not very effective due to the vocabulary mismatches that often occur between regulatory codes and product level requirements. This paper introduces and evaluates two machine-learning methods, designed to improve the quality of traces generated between regulatory codes and product level requirements. The first approach uses manually created traceability matrices to train a trace classifier, while the second approach uses web-mining techniques to reconstruct the original trace query. The techniques were evaluated against security regulations from the USA government´s Health Insurance Privacy and Portability Act (HIPAA) traced against ten healthcare related requirements specifications. Results demonstrated improvements for the subset of HIPAA regulations that exhibited high fan-out behavior across the requirements datasets.
Keywords :
health care; learning (artificial intelligence); program diagnostics; security of data; software maintenance; HIPAA; Health Insurance Privacy and Portability Act; healthcare; machine learning approach; product specific requirements; regulatory codes; security regulations; software intensive systems; software refactoring; traceability matrices; web-mining techniques; Encryption; Measurement; Medical services; Probabilistic logic; Software; Training; regulatory compliance; requirements classification; traceability;
Conference_Titel :
Software Engineering, 2010 ACM/IEEE 32nd International Conference on
Conference_Location :
Cape Town
Print_ISBN :
978-1-60558-719-6
DOI :
10.1145/1806799.1806825