• DocumentCode
    3118412
  • Title

    Improving disease prediction using ICD-9 ontological features

  • Author

    Popescu, Mihail ; Khalilia, Mohammad

  • Author_Institution
    Health Manage. & Inf. Dept., Univ. of Missouri, Columbia, MO, USA
  • fYear
    2011
  • fDate
    27-30 June 2011
  • Firstpage
    1805
  • Lastpage
    1809
  • Abstract
    Disease prediction has become important in a variety of applications such as health insurance, tailored health communication and public health. Disease prediction is usually performed using publically available datasets such as HCUP, NHANES or MDS that were initially designed for health reporting or health cost evaluation but not for disease prediction. In these datasets, medical diagnoses are traditionally arranged in "diagnose-related groups" (DRGs). In this paper we compare the disease prediction based on crisp DRG features with the results obtained employing a new set of features that consist of the fuzzy membership of patient diagnoses in the DRG groups. The fuzzy membership features were computed using an ICD-9 ontological similarity approach. The prediction results obtained on a subset of 9,000 patients from the 2005 HCUP data representing three diseases (diabetes, atherosclerosis and hypertension) using two classifiers (random forest and SVM trained on 21,000 samples) show significant (about 10%) improvement as measured by the area under the ROC curve (AROC).
  • Keywords
    diseases; fuzzy set theory; health care; medical computing; ontologies (artificial intelligence); support vector machines; HCUP database; ICD-9 ontological features; MDS database; NHANES database; ROC curve; SVM; atherosclerosis; diabetes; diagnose-related groups; disease prediction; fuzzy membership features; health cost evaluation; health insurance; health reporting evaluation; hypertension; medical diagnoses; patient diagnoses; public health; random forest; tailored health communication; Diabetes; Diseases; Feature extraction; Hypertension; Medical diagnostic imaging; Radio frequency; Support vector machines; ICD-9 similarity measure; SVM; disease prediction; ontological features; random forest;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ), 2011 IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4244-7315-1
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZY.2011.6007410
  • Filename
    6007410