Title :
Empirical investigation of consensus clustering for large ECG data sets
Author :
Kelarev, Andrei ; Stranieri, Andrew ; Yearwood, John ; Jelinek, Herbert
Author_Institution :
Centre for Inf. & Appl. Optimization, Univ. of Ballarat, Ballarat, VIC, Australia
Abstract :
This article investigates a novel machine learning approach applying consensus clustering in conjunction with classification for the data mining of very large and highly dimensional ECG data sets. To obtain robust and stable clusterings, consensus functions can be applied for clustering ensembles combining a multitude of independent initial clusterings. Direct applications of consensus functions to highly dimensional ECG data sets remain computationally expensive and impracticable. We introduce a multistage scheme including various procedures for dimensionality reduction, consensus clustering of randomized samples, followed by the use of a fast supervised classification algorithm. Applying the Hybrid Bipartite Graph Formulation combined with rank ordering and SMO we obtained an area under the receiver operating curve of 0.987. The performance of the classification algorithm at the final stage is crucial for the effectiveness of this technique. It can be regarded as an indication of the reliability, quality and stability of the combined consensus clustering.
Keywords :
data mining; electrocardiography; learning (artificial intelligence); medical signal processing; signal classification; clustering ensembles; consensus clustering; data mining; dimensionality reduction; fast supervised classification algorithm; hybrid bipartite graph formulation; independent initial clusterings; large ECG data sets; multistage scheme; novel machine learning approach; randomized samples; receiver operating curve; Accuracy; Classification algorithms; Clustering algorithms; Correlation; Data mining; Electrocardiography; Principal component analysis;
Conference_Titel :
Computer-Based Medical Systems (CBMS), 2012 25th International Symposium on
Conference_Location :
Rome
Print_ISBN :
978-1-4673-2049-8
DOI :
10.1109/CBMS.2012.6266364