DocumentCode :
3175163
Title :
De-identification algorithm for free-text nursing notes
Author :
Douglass, M.M. ; Cliffford, G.D. ; Reisner, A. ; Long, W.J. ; Moody, G.B. ; Mark, R.G.
Author_Institution :
Div. of Health Sci. & Technol., Harvard-MIT, Cambridge, MA
fYear :
2005
fDate :
25-28 Sept. 2005
Firstpage :
331
Lastpage :
334
Abstract :
All personally identifiable information must be removed from patient medical records before the data can be shared with other researchers. We present an automated method of removing protected health information (PHI) from free-text nursing notes taken from a U.S. hospital. We have previously shown that one clinician can locate PHI in nursing notes with an average sensitivity of 0.81, and for teams of two clinicians the sensitivity is 0.94. Our method uses lexical look-up tables, regular expressions, and simple heuristics to locate PHI with an overall sensitivity of 0.92 (0.98 for names, 0.96 for dates), which is significantly better than the average sensitivity of a single human. The algorithm has a positive predictive value of only 0.44, so additional software was developed to allow the user to review the terms identified as PHI and manually eliminate false positives. The algorithm is open-source and will be made freely available on PhysioNet together with a re-identified corpus of nursing notes
Keywords :
health care; heuristic programming; medical information systems; public domain software; table lookup; PhysioNet; de-identification algorithm; free-text nursing notes; heuristics; look-up table; open-source algorithm; patient medical record; personally identifiable information; protected health information; regular expressions; Computer science; Guidelines; Hospitals; Humans; Medical services; Open source software; Protection; Roentgenium; Software algorithms; Terminology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computers in Cardiology, 2005
Conference_Location :
Lyon
Print_ISBN :
0-7803-9337-6
Type :
conf
DOI :
10.1109/CIC.2005.1588104
Filename :
1588104
Link To Document :
بازگشت