• DocumentCode
    591263
  • Title

    Robust prediction of patient mortality from 48 hour intensive care unit data

  • Author

    Di Marco, Luigi Yuri ; Bojarnejad, Marjan ; King, S.T. ; Wenfeng Duan ; Di Maria, Costanzo ; Dingchang Zheng ; Murray, A. ; Langley, Philip

  • Author_Institution
    Newcastle Univ., Newcastle upon Tyne, UK
  • fYear
    2012
  • fDate
    9-12 Sept. 2012
  • Firstpage
    477
  • Lastpage
    480
  • Abstract
    The aim of this study was to develop a new algorithm to predict individual patient mortality with improved accuracy with respect to established methods from data collected over the first 48 hours of admission to the Intensive Care Unit. A binary classifier was developed to participate in Event 1 of the PhysioNet/Computing in Cardiology Challenge 2012. The algorithm development was undertaken using only posterior knowledge from the training dataset (Set-A), containing 41 demographic and clinical variables from 4000 ICU patients. For each variable a feature was defined as the average (across all available measurements of the given variable) likelihood of being part of the “survivors” group. To select features with highest discrimination ability (“survivors” vs. “non-survivors”), a forward sequential selection criterion with logistic cost function was adopted and repeated for cross-validation on N (=10) “leave Mout” (M=50%) random partitions of Set-A. Features that were selected in more than one partition were considered (#Feat = 32). A logistic regression model was used for classification. The score was defined as the lowest between sensitivity and positive predictive value in classification. The proposed method scored 54.9% on Set-A and 44.0% on the test set (Set-B), outperforming the established method SAPS-I (29.6% on Set-A, 31.7% on Set-B).
  • Keywords
    Internet; feature extraction; medical information systems; regression analysis; PhysioNet-computing; SAPS-I method; binary classifier; cardiology challenge 2012; clinical variables; data collection; demographic variables; forward sequential selection criterion; individual patient mortality; intensive care unit data; logistic cost function; logistic regression model; positive predictive value; posterior knowledge; sensitivity value; survivor group; test set; time 48 h; training dataset; Accuracy; Classification algorithms; Hospitals; Logistics; Physiology; Prediction algorithms; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing in Cardiology (CinC), 2012
  • Conference_Location
    Krakow
  • ISSN
    2325-8861
  • Print_ISBN
    978-1-4673-2076-4
  • Type

    conf

  • Filename
    6420434