DocumentCode :
2109889
Title :
Predicting atrial fibrillation and flutter using Electronic Health Records
Author :
Karnik, S. ; Sin Lam Tan ; Berg, B. ; Glurich, I. ; Jinfeng Zhang ; Vidaillet, H.J. ; Page, C.D. ; Chowdhary, R.
Author_Institution :
Biomed. Inf. Res. Center, Marshfield Clinic Res. Found., Marshfield, WI, USA
fYear :
2012
fDate :
Aug. 28 2012-Sept. 1 2012
Firstpage :
5562
Lastpage :
5565
Abstract :
Electronic Health Records (EHR) contain large amounts of useful information that could potentially be used for building models for predicting onset of diseases. In this study, we have investigated the use of free-text and coded data in Marshfield Clinic´s EHR, individually and in combination for building machine learning based models to predict the first ever episode of atrial fibrillation and/or atrial flutter (AFF). We trained and evaluated our AFF models on the EHR data across different time intervals (1, 3, 5 and all years) prior to first documented onset of AFF. We applied several machine learning methods, including naïve bayes, support vector machines (SVM), logistic regression and random forests for building AFF prediction models and evaluated these using 10-fold cross-validation approach. On text-based datasets, the best model achieved an F-measure of 60.1%, when applied exclusively to coded data. The combination of textual and coded data achieved comparable performance. The study results attest to the relative merit of utilizing textual data to complement the use of coded data for disease onset prediction modeling.
Keywords :
Bayes methods; diseases; learning (artificial intelligence); medical information systems; support vector machines; text analysis; AFF prediction models; EHR data; Marshfield clinic; SVM; atrial fibrillation; atrial flutter; coded data; disease onset prediction modeling; electronic health records; free-text; logistic regression; machine learning based models; naïve Bayes; random forests; support vector machines; text-based datasets; useful information; Atrial fibrillation; Data models; Diseases; Medical diagnostic imaging; Predictive models; Support vector machines; Unified modeling language; Atrial Fibrillation; Atrial Flutter; Electronic Health Records; Humans;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society (EMBC), 2012 Annual International Conference of the IEEE
Conference_Location :
San Diego, CA
ISSN :
1557-170X
Print_ISBN :
978-1-4244-4119-8
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/EMBC.2012.6347254
Filename :
6347254
Link To Document :
بازگشت