DocumentCode
3118412
Title
Improving disease prediction using ICD-9 ontological features
Author
Popescu, Mihail ; Khalilia, Mohammad
Author_Institution
Health Manage. & Inf. Dept., Univ. of Missouri, Columbia, MO, USA
fYear
2011
fDate
27-30 June 2011
Firstpage
1805
Lastpage
1809
Abstract
Disease prediction has become important in a variety of applications such as health insurance, tailored health communication and public health. Disease prediction is usually performed using publically available datasets such as HCUP, NHANES or MDS that were initially designed for health reporting or health cost evaluation but not for disease prediction. In these datasets, medical diagnoses are traditionally arranged in "diagnose-related groups" (DRGs). In this paper we compare the disease prediction based on crisp DRG features with the results obtained employing a new set of features that consist of the fuzzy membership of patient diagnoses in the DRG groups. The fuzzy membership features were computed using an ICD-9 ontological similarity approach. The prediction results obtained on a subset of 9,000 patients from the 2005 HCUP data representing three diseases (diabetes, atherosclerosis and hypertension) using two classifiers (random forest and SVM trained on 21,000 samples) show significant (about 10%) improvement as measured by the area under the ROC curve (AROC).
Keywords
diseases; fuzzy set theory; health care; medical computing; ontologies (artificial intelligence); support vector machines; HCUP database; ICD-9 ontological features; MDS database; NHANES database; ROC curve; SVM; atherosclerosis; diabetes; diagnose-related groups; disease prediction; fuzzy membership features; health cost evaluation; health insurance; health reporting evaluation; hypertension; medical diagnoses; patient diagnoses; public health; random forest; tailored health communication; Diabetes; Diseases; Feature extraction; Hypertension; Medical diagnostic imaging; Radio frequency; Support vector machines; ICD-9 similarity measure; SVM; disease prediction; ontological features; random forest;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
Conference_Location
Taipei
ISSN
1098-7584
Print_ISBN
978-1-4244-7315-1
Electronic_ISBN
1098-7584
Type
conf
DOI
10.1109/FUZZY.2011.6007410
Filename
6007410
Link To Document