Title :
Handling missing values in data mining - A case study of heart failure dataset
Author :
Poolsawad, N. ; Moore, L. ; Kambhampati, C. ; Cleland, J.G.F.
Author_Institution :
Distrib. Reliable Intell. Syst. Res. Group (DRIS), Univ. of HullHull, Hull, UK
Abstract :
In this paper, we investigate the characteristics of a clinical dataset using feature selection and classification techniques to deal with missing values and develop a method to quantify numerous complexities. The research aims to find features that have high effect on mortality time frame, and to design methodologies which will cope with the following challenges: missing values, high dimensionality, and the prediction problem. The experimental results will be extended to develop prediction model for HF This paper also provides a comprehensive evaluation of a set of diverse machine learning schemes for clinical datasets.
Keywords :
data mining; medical administrative data processing; pattern classification; classification techniques; clinical dataset; data mining; feature selection; heart failure dataset; high dimensionality problem; missing values problem; prediction problem; Accuracy; Artificial neural networks; Classification algorithms; Data mining; Heart; Prediction algorithms; Predictive models; classification; clinical dataset; data mining; feature selection; heart failure; missing values;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2012 9th International Conference on
Conference_Location :
Sichuan
Print_ISBN :
978-1-4673-0025-4
DOI :
10.1109/FSKD.2012.6233860