• DocumentCode
    742237
  • Title

    Predicting Days in Hospital Using Health Insurance Claims

  • Author

    Yang Xie ; Schreier, Gunter ; Chang, David C. W. ; Neubauer, Sandra ; Ying Liu ; Redmond, Stephen J. ; Lovell, Nigel H.

  • Author_Institution
    Grad. Sch. of Biomed. Eng., Univ. of New South Wales, Sydney, NSW, Australia
  • Volume
    19
  • Issue
    4
  • fYear
    2015
  • fDate
    7/1/2015 12:00:00 AM
  • Firstpage
    1224
  • Lastpage
    1233
  • Abstract
    Health-care administrators worldwide are striving to lower the cost of care while improving the quality of care given. Hospitalization is the largest component of health expenditure. Therefore, earlier identification of those at higher risk of being hospitalized would help health-care administrators and health insurers to develop better plans and strategies. In this paper, a method was developed, using large-scale health insurance claims data, to predict the number of hospitalization days in a population. We utilized a regression decision tree algorithm, along with insurance claim data from 242 075 individuals over three years, to provide predictions of number of days in hospital in the third year, based on hospital admissions and procedure claims data. The proposed method performs well in the general population as well as in subpopulations. Results indicate that the proposed model significantly improves predictions over two established baseline methods (predicting a constant number of days for each customer and using the number of days in hospital of the previous year as the forecast for the following year). A reasonable predictive accuracy (AUC $=0.843$) was achieved for the whole population. Analysis of two subpopulations-namely elderly persons aged 63 years or older in 2011 and patients hospitalized for at least one day in the previous year-revealed that the medical information (e.g., diagnosis codes) contributed more to predictions for these two subpopulations, in comparison to the population as a whole.
  • Keywords
    decision trees; geriatrics; health care; insurance data processing; medical information systems; regression analysis; diagnosis codes; elderly persons; health expenditure; health insurance claims; health insurers; health-care administrators; hospitalization days; medical information; regression decision tree algorithm; Australia; Feature extraction; Hospitals; Insurance; Integrated circuits; Medical diagnostic imaging; Predictive models; Australia; big data; health care; health insurance claims; hospitalizations; predictive modeling;
  • fLanguage
    English
  • Journal_Title
    Biomedical and Health Informatics, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    2168-2194
  • Type

    jour

  • DOI
    10.1109/JBHI.2015.2402692
  • Filename
    7039220