DocumentCode
742237
Title
Predicting Days in Hospital Using Health Insurance Claims
Author
Yang Xie ; Schreier, Gunter ; Chang, David C. W. ; Neubauer, Sandra ; Ying Liu ; Redmond, Stephen J. ; Lovell, Nigel H.
Author_Institution
Grad. Sch. of Biomed. Eng., Univ. of New South Wales, Sydney, NSW, Australia
Volume
19
Issue
4
fYear
2015
fDate
7/1/2015 12:00:00 AM
Firstpage
1224
Lastpage
1233
Abstract
Health-care administrators worldwide are striving to lower the cost of care while improving the quality of care given. Hospitalization is the largest component of health expenditure. Therefore, earlier identification of those at higher risk of being hospitalized would help health-care administrators and health insurers to develop better plans and strategies. In this paper, a method was developed, using large-scale health insurance claims data, to predict the number of hospitalization days in a population. We utilized a regression decision tree algorithm, along with insurance claim data from 242 075 individuals over three years, to provide predictions of number of days in hospital in the third year, based on hospital admissions and procedure claims data. The proposed method performs well in the general population as well as in subpopulations. Results indicate that the proposed model significantly improves predictions over two established baseline methods (predicting a constant number of days for each customer and using the number of days in hospital of the previous year as the forecast for the following year). A reasonable predictive accuracy (AUC $=0.843$) was achieved for the whole population. Analysis of two subpopulations-namely elderly persons aged 63 years or older in 2011 and patients hospitalized for at least one day in the previous year-revealed that the medical information (e.g., diagnosis codes) contributed more to predictions for these two subpopulations, in comparison to the population as a whole.
Keywords
decision trees; geriatrics; health care; insurance data processing; medical information systems; regression analysis; diagnosis codes; elderly persons; health expenditure; health insurance claims; health insurers; health-care administrators; hospitalization days; medical information; regression decision tree algorithm; Australia; Feature extraction; Hospitals; Insurance; Integrated circuits; Medical diagnostic imaging; Predictive models; Australia; big data; health care; health insurance claims; hospitalizations; predictive modeling;
fLanguage
English
Journal_Title
Biomedical and Health Informatics, IEEE Journal of
Publisher
ieee
ISSN
2168-2194
Type
jour
DOI
10.1109/JBHI.2015.2402692
Filename
7039220
Link To Document