DocumentCode :
243527
Title :
Clustering Ensemble and Application in HST Dataset
Author :
Wenchao Xiao ; Yan Yang ; Hongjun Wang ; Yingge Xu
Author_Institution :
Sch. of Inf. Sci. & Technol., Southwest Jiaotong Univ., Chengdu, China
fYear :
2014
fDate :
14-14 Dec. 2014
Firstpage :
213
Lastpage :
220
Abstract :
Clustering ensemble is an important part of ensemble learning. It aims to study and integrate multiple clustering results from different clustering algorithms or same algorithm with different initial parameters for the same dataset. CHAMELEON is a hierarchical clustering algorithm which can discover natural clusters of different shapes and sizes as the result of its merging decision dynamically adapts to the different clustering model characterized. Inspired by the idea of CHAMELEON, the paper proposes a novel clustering ensemble model including semi-supervised method and discusses its application in fault diagnosis of high speed train (HST) running gear. The model is divided into three phases. Phase 1 is constructing a sparse graph through similarity matrix which aggregates multiple clustering results. Phase 2 is partitioning the sparse graph (vertex = object, edge weight = similarity) into a large number of relatively small sub-clusters. Phase 3 is obtaining the final clustering partition by merging these sub-clusters repeatedly. The experimental results demonstrate that our method out-performs some of state-of-the-art ensemble algorithms regarding the accuracy and stability and recognizes fault patterns of HST running gear effectively.
Keywords :
condition monitoring; fault diagnosis; gears; graph theory; learning (artificial intelligence); matrix algebra; mechanical engineering computing; railways; CHAMELEON; HST running gear; clustering ensemble; ensemble learning; fault diagnosis; hierarchical clustering algorithm; high speed train running gear; semisupervised method; similarity matrix; sparse graph; Clustering algorithms; Educational institutions; Gears; Information science; Merging; Partitioning algorithms; Sparse matrices; CHAMELEON; Clustering Ensemble; Fault Diagnosis; Semi-supervised;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
Type :
conf
DOI :
10.1109/ICDMW.2014.143
Filename :
7022600
Link To Document :
بازگشت