• DocumentCode
    1238389
  • Title

    Predicting the Outcome of Patients With Subarachnoid Hemorrhage Using Machine Learning Techniques

  • Author

    De Toledo, Paula ; Rios, Pablo M. ; Ledezma, Agapito ; Sanchis, Araceli ; Alen, Jose F. ; Lagares, Alfonso

  • Author_Institution
    Control, Learning, & Syst. Optimization Group, Univ. Carlos III de Madrid, Leganes, Spain
  • Volume
    13
  • Issue
    5
  • fYear
    2009
  • Firstpage
    794
  • Lastpage
    801
  • Abstract
    Background: Outcome prediction for subarachnoid hemorrhage (SAH) helps guide care and compare global management strategies. Logistic regression models for outcome prediction may be cumbersome to apply in clinical practice. Objective: To use machine learning techniques to build a model of outcome prediction that makes the knowledge discovered from the data explicit and communicable to domain experts. Material and methods: A derivation cohort (n = 441) of nonselected SAH cases was analyzed using different classification algorithms to generate decision trees and decision rules. Algorithm used were C4.5, fast decision tree learner, partial decision trees, repeated incremental pruning to produce error reduction, nearest neighbor with generalization, and ripple down rule learner. Outcome was dichotomized in favorable [Glasgow outcome scale (GOS) = I-II] and poor (GOS = III-V). An independent cohort ( n = 193) was used for validation. An exploratory questionnaire was given to potential users (specialist doctors) to gather their opinion on the classifier and its usability in clinical routine. Results: The best classifier was obtained with the C4.5 algorithm. It uses only two attributes [World Federation of Neurological Surgeons (WFNS) and Fisher´s scale] and leads to a simple decision tree. The accuracy of the classifier [area under the ROC curve (AUC) = 0.84; confidence interval (CI) = 0.80-0.88] is similar to that obtained by a logistic regression model (AUC = 0.86; CI = 0.83-0.89) derived from the same data and is considered better fit for clinical use.
  • Keywords
    data mining; decision trees; health care; learning (artificial intelligence); medical disorders; medical information systems; neurophysiology; pattern classification; regression analysis; C4.5 algorithm; decision rule; error reduction; fast decision tree learner; health care; knowledge discovery; logistic regression model; machine learning; nearest neighbor algorithm; partial decision tree; patient outcome prediction; pattern classification; repeated incremental pruning; ripple down rule learner; subarachnoid hemorrhage; Data mining; knowledge discovery in databases; machine learning; prognosis; subarachnoid hemorrhage; Adolescent; Adult; Aged; Aged, 80 and over; Algorithms; Artificial Intelligence; Female; Humans; Male; Middle Aged; Predictive Value of Tests; Prognosis; Reproducibility of Results; Subarachnoid Hemorrhage;
  • fLanguage
    English
  • Journal_Title
    Information Technology in Biomedicine, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1089-7771
  • Type

    jour

  • DOI
    10.1109/TITB.2009.2020434
  • Filename
    4814677