Title :
Matching data fragments with imperfect identifiers from disparate sources
Author :
Craig, Michael B. ; Moody, Benjamin E. ; Jia, Sherman ; Villarroel, Mauricio C. ; Mark, Roger G.
Author_Institution :
Div. of Health Sci. & Technol., Harvard-MIT, Cambridge, MA, USA
Abstract :
The Multiparameter Intelligent Monitoring in Intensive Care (MIMIC-II) Database includes waveforms and derived parameters from bedside monitors, clinical data from an ICU information system, and data from other hospital laboratories and archives, for thousands of patients. These data come from devices under separate domains that often do not retain detailed information regarding relationships between parameters. We developed software for matching data fragments with incomplete and sometimes incorrect identifiers. We found that names, medical record numbers, waveform times and durations, and ICU admission and discharge records were most helpful when available; however, physiological data can also be used in some circumstances. Rule-based normalization and text edit-distance metrics are used in addition to a visual verification tool for patients whose records cannot be assembled automatically. Thus, a majority of the available waveform recordings are matched to patients in the clinical database.
Keywords :
medical information systems; medical signal processing; patient monitoring; waveform analysis; ICU information system; bedside monitors; clinical database; discharge records; hospital laboratories; imperfect identifiers; intensive care database; matching data fragments; medical record; multiparameter intelligent monitoring; physiological data; rule-based normalization; text edit-distance metrics; visual verification tool; waveform recordings; Biomedical monitoring; Databases; Heart rate; Hospitals; Monitoring; Real time systems; Servers;
Conference_Titel :
Computing in Cardiology, 2010
Conference_Location :
Belfast
Print_ISBN :
978-1-4244-7318-2