• DocumentCode
    3165371
  • Title

    Cox Regression with Correlation Based Regularization for Electronic Health Records

  • Author

    Vinzamuri, Bhanukiran ; Reddy, C.K.

  • Author_Institution
    Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
  • fYear
    2013
  • fDate
    7-10 Dec. 2013
  • Firstpage
    757
  • Lastpage
    766
  • Abstract
    Survival Regression models play a vital role in analyzing time-to-event data in many practical applications ranging from engineering to economics to healthcare. These models are ideal for prediction in complex data problems where the response is a time-to-event variable. An event is defined as the occurrence of a specific event of interest such as a chronic health condition. Cox regression is one of the most popular survival regression model used in such applications. However, these models have the tendency to over fit the data which is not desirable for healthcare applications because it limits their generalization to other hospital scenarios. In this paper, we address these challenges for the cox regression model. We combine two unique correlation based regularizers with cox regression to handle correlated and grouped features which are commonly seen in many practical problems. The proposed optimization problems are solved efficiently using cyclic coordinate descent and Alternate Direction Method of Multipliers algorithms. We conduct experimental analysis on the performance of these algorithms over several synthetic datasets and electronic health records (EHR) data about heart failure diagnosed patients from a hospital. We demonstrate through our experiments that these regularizers effectively enhance the ability of cox regression to handle correlated features. In addition, we extensively compare our results with other regularized linear and logistic regression algorithms. We validate the goodness of the features selected by these regularized cox regression models using the biomedical literature and different feature selection algorithms.
  • Keywords
    cardiology; data analysis; electronic health records; feature selection; hospitals; optimisation; patient diagnosis; regression analysis; EHR data; biomedical literature; chronic health condition; correlation based regularization; cox regression model; cyclic coordinate descent; electronic health records; feature selection algorithm; heart failure diagnosed patients; hospital; logistic regression algorithm; multipliers algorithm alternate direction method; optimization problems; regularized linear regression algorithm; survival regression models; synthetic datasets; time-to-event data analysis; Algorithm design and analysis; Biological system modeling; Equations; Hazards; Kernel; Medical services; Vectors; cox regression; feature selection; healthcare; regularization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2013 IEEE 13th International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2013.89
  • Filename
    6729560