A novel approach to select significant genes of leukemia cancer data using K-Means clustering

Author

Palanisamy, P. ; Perumal ; Thangavel, K. ; Manavalan, R.

Author_Institution

Dept. of Biotechnol., Periyar Univ., Salem, India

fYear

2013

fDate

21-22 Feb. 2013

Firstpage

104

Lastpage

108

Abstract

DNA microarray technologies are leading to an explosion in available gene expression data which simultaneously monitor the expression pattern of thousands of genes. All the genes may not be biologically significant in diagnosing the disease. In this paper, a novel approach has been proposed to select significant genes of leukemia cancer using K-Means clustering algorithm. It is an unsupervised machine learning approach, which is being used to identify the unknown patterns from the huge amount of data. The proposed K-Means algorithm has been experimented to cluster the genes for K=5,10 and 15. The significant genes have been identified through the best accuracy obtained from the clusters generated. The accuracy of the clusters are determined again by using K-Means algorithm compared with ground truth values.

Keywords

biology computing; cancer; genetics; molecular biophysics; pattern classification; unsupervised learning; DNA microarray technology; K-means clustering; disease diagnosis; gene expression data; gene selection; ground truth value; leukemia cancer data; unsupervised machine learning approach; Accuracy; Cancer; Clustering algorithms; Data analysis; Diseases; Gene expression; Sensitivity; Accuracy; Clustering; K-Means; Leukemia; Microarray; Sensitivity; Specificity;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, Informatics and Mobile Engineering (PRIME), 2013 International Conference on

Conference_Location

Salem

Print_ISBN

978-1-4673-5843-9

Type

conf

DOI

10.1109/ICPRIME.2013.6496455

Filename

6496455