Title :
Predicting Days in Hospital Using Health Insurance Claims
Author :
Yang Xie ; Schreier, Gunter ; Chang, David C. W. ; Neubauer, Sandra ; Ying Liu ; Redmond, Stephen J. ; Lovell, Nigel H.
Author_Institution :
Grad. Sch. of Biomed. Eng., Univ. of New South Wales, Sydney, NSW, Australia
fDate :
7/1/2015 12:00:00 AM
Abstract :
Health-care administrators worldwide are striving to lower the cost of care while improving the quality of care given. Hospitalization is the largest component of health expenditure. Therefore, earlier identification of those at higher risk of being hospitalized would help health-care administrators and health insurers to develop better plans and strategies. In this paper, a method was developed, using large-scale health insurance claims data, to predict the number of hospitalization days in a population. We utilized a regression decision tree algorithm, along with insurance claim data from 242 075 individuals over three years, to provide predictions of number of days in hospital in the third year, based on hospital admissions and procedure claims data. The proposed method performs well in the general population as well as in subpopulations. Results indicate that the proposed model significantly improves predictions over two established baseline methods (predicting a constant number of days for each customer and using the number of days in hospital of the previous year as the forecast for the following year). A reasonable predictive accuracy (AUC $=0.843$) was achieved for the whole population. Analysis of two subpopulations-namely elderly persons aged 63 years or older in 2011 and patients hospitalized for at least one day in the previous year-revealed that the medical information (e.g., diagnosis codes) contributed more to predictions for these two subpopulations, in comparison to the population as a whole.
Keywords :
decision trees; geriatrics; health care; insurance data processing; medical information systems; regression analysis; diagnosis codes; elderly persons; health expenditure; health insurance claims; health insurers; health-care administrators; hospitalization days; medical information; regression decision tree algorithm; Australia; Feature extraction; Hospitals; Insurance; Integrated circuits; Medical diagnostic imaging; Predictive models; Australia; big data; health care; health insurance claims; hospitalizations; predictive modeling;
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
DOI :
10.1109/JBHI.2015.2402692