Title :
A novel approach to select significant genes of leukemia cancer data using K-Means clustering
Author :
Palanisamy, P. ; Perumal ; Thangavel, K. ; Manavalan, R.
Author_Institution :
Dept. of Biotechnol., Periyar Univ., Salem, India
Abstract :
DNA microarray technologies are leading to an explosion in available gene expression data which simultaneously monitor the expression pattern of thousands of genes. All the genes may not be biologically significant in diagnosing the disease. In this paper, a novel approach has been proposed to select significant genes of leukemia cancer using K-Means clustering algorithm. It is an unsupervised machine learning approach, which is being used to identify the unknown patterns from the huge amount of data. The proposed K-Means algorithm has been experimented to cluster the genes for K=5,10 and 15. The significant genes have been identified through the best accuracy obtained from the clusters generated. The accuracy of the clusters are determined again by using K-Means algorithm compared with ground truth values.
Keywords :
biology computing; cancer; genetics; molecular biophysics; pattern classification; unsupervised learning; DNA microarray technology; K-means clustering; disease diagnosis; gene expression data; gene selection; ground truth value; leukemia cancer data; unsupervised machine learning approach; Accuracy; Cancer; Clustering algorithms; Data analysis; Diseases; Gene expression; Sensitivity; Accuracy; Clustering; K-Means; Leukemia; Microarray; Sensitivity; Specificity;
Conference_Titel :
Pattern Recognition, Informatics and Mobile Engineering (PRIME), 2013 International Conference on
Conference_Location :
Salem
Print_ISBN :
978-1-4673-5843-9
DOI :
10.1109/ICPRIME.2013.6496455