DocumentCode :
2471228
Title :
Improved link-based cluster ensembles for microarray data analysis
Author :
Iam-On, Natthakan ; Boongoen, Tossapon
Author_Institution :
Sch. of Inf. Technol., Mae Fah Luang Univ., Chiang Rai, Thailand
fYear :
2012
fDate :
14-17 Oct. 2012
Firstpage :
2014
Lastpage :
2019
Abstract :
Cancer has been identified as the leading cause of death. It is predicted that around 20-26 million people will be diagnosed with cancer by 2020. As a result, there is an urgent need for a more effective methodology to prevent and cure cancer. Microarray technology provides a useful basis of achieving this ultimate goal. For cancer research, it has become almost routine to create gene expression profiles, which can discriminate patients into good and poor prognosis groups. This cluster analysis offers a useful basis for individualized treatment of disease. Cluster ensembles have been shown to be better than any standard clustering algorithm for such a task. This meta-learning formalism helps users to overcome the dilemma of selecting an appropriate technique and the parameters for that technique, given a set of data. Among different state-of-the-art methods, the link-based approach (LCE) provides a highly accurate clustering. This paper presents the improvement of LCE with a new link-based similarity measure being developed and engaged. Additional information that is already available in an information network is included in the similarity assessment. As such, this refinement can increase the quality of the measures, hence the resulting cluster decision. The performance of this improved LCE is evaluated on published microarray datasets, in comparison with the original LCE and several well-known cluster ensemble techniques. The findings suggest that the new model can improve the accuracy of LCE and performs better than the others investigated in the empirical study.
Keywords :
cancer; data analysis; genetics; learning (artificial intelligence); medical computing; pattern clustering; cancer prevention; cancer research; cluster analysis; cluster decision; disease treatment; gene expression profile; information network; link-based cluster ensemble; link-based similarity measure; meta-learning formalism; microarray data analysis; prognosis group; similarity assessment; Accuracy; Cancer; Clustering algorithms; Clustering methods; Gene expression; Indexes; Standards; cluster ensembles; clustering; link-based similarity; microarray data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location :
Seoul
Print_ISBN :
978-1-4673-1713-9
Electronic_ISBN :
978-1-4673-1712-2
Type :
conf
DOI :
10.1109/ICSMC.2012.6378034
Filename :
6378034
Link To Document :
بازگشت