Title :
Assessing flexible models and rule extraction from censored survival data
Author :
Lisboa, Paulo J G ; Biganzoli, Elia M. ; Taktak, Azzam F. ; Etchells, Terence A. ; Jarman, Ian H. ; Aung, M. S Hane ; Ambrogi, Federico
Author_Institution :
Liverpool John Moores Univ., Liverpool
Abstract :
The evaluation of generic non-linear models for censored data needs to address the two complementary requirements in the software development life-cycle, of validation and verification. The former involves making a rigorous assessment of predictive accuracy in prognostic modelling and the latter is interpreted in this paper as comprising two different stages, namely model selection and rule-based interpretation of the composition of prognostic risk groups. With reference to prognostic performance is survival modelling the well-known ROC framework has recently been extended to a threshold independent, time-dependent performance index to quantify the predictive accuracy of censored data models, termed the C´ index, which is briefly described. The rule-based framework for direct validation of risk group allocation against expert domain knowledge, uses low-order Boolean rules to approximate the response surfaces generated by analytical inference models. In the case of censored data, this approach serves to characterise the allocation of patients into risk groups generated by a risk staging index. Furthermore, the low-order rules define low-dimensional sub-spaces where individual data points can be directly visualised in relation to the decision boundaries for their risk group. Taken together, the quantitative performance index, Boolean explanatory rules and direct visualisation of the data, define a consistent and transparent validation framework based on triangulation of information. This information can be included in decision support systems.
Keywords :
decision support systems; feature extraction; knowledge based systems; response surface methodology; software engineering; censored survival data; decision support systems; expert domain knowledge; flexible models; generic nonlinear models; information triangulation; low-order Boolean rules; model selection; predictive accuracy; prognostic modelling; prognostic risk modelling; response surfaces; risk group allocation; rule extraction; rule-based interpretation; software development life-cycle; Accuracy; Analytical models; Data mining; Data models; Data visualization; Performance analysis; Predictive models; Programming; Response surface methodology; Risk analysis;
Conference_Titel :
Neural Networks, 2007. IJCNN 2007. International Joint Conference on
Conference_Location :
Orlando, FL
Print_ISBN :
978-1-4244-1379-9
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2007.4371207