Title :
Improving disease prediction using ICD-9 ontological features
Author :
Popescu, Mihail ; Khalilia, Mohammad
Author_Institution :
Health Manage. & Inf. Dept., Univ. of Missouri, Columbia, MO, USA
Abstract :
Disease prediction has become important in a variety of applications such as health insurance, tailored health communication and public health. Disease prediction is usually performed using publically available datasets such as HCUP, NHANES or MDS that were initially designed for health reporting or health cost evaluation but not for disease prediction. In these datasets, medical diagnoses are traditionally arranged in "diagnose-related groups" (DRGs). In this paper we compare the disease prediction based on crisp DRG features with the results obtained employing a new set of features that consist of the fuzzy membership of patient diagnoses in the DRG groups. The fuzzy membership features were computed using an ICD-9 ontological similarity approach. The prediction results obtained on a subset of 9,000 patients from the 2005 HCUP data representing three diseases (diabetes, atherosclerosis and hypertension) using two classifiers (random forest and SVM trained on 21,000 samples) show significant (about 10%) improvement as measured by the area under the ROC curve (AROC).
Keywords :
diseases; fuzzy set theory; health care; medical computing; ontologies (artificial intelligence); support vector machines; HCUP database; ICD-9 ontological features; MDS database; NHANES database; ROC curve; SVM; atherosclerosis; diabetes; diagnose-related groups; disease prediction; fuzzy membership features; health cost evaluation; health insurance; health reporting evaluation; hypertension; medical diagnoses; patient diagnoses; public health; random forest; tailored health communication; Diabetes; Diseases; Feature extraction; Hypertension; Medical diagnostic imaging; Radio frequency; Support vector machines; ICD-9 similarity measure; SVM; disease prediction; ontological features; random forest;
Conference_Titel :
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-7315-1
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZY.2011.6007410