DocumentCode :
1784776
Title :
Cluster center genes as candidate biomarkers for the classification of Leukemia
Author :
Dela Rosa, Jonnel L. ; Magpantay, Alvin Edwin A. ; Gonzaga, Alex C. ; Solano, Geoffrey A.
Author_Institution :
Dept. of Phys. Sci. & Math., Univ. of the Philippines, Manila, Philippines
fYear :
2014
fDate :
7-9 July 2014
Firstpage :
124
Lastpage :
129
Abstract :
Modern technologies such as DNA microarray have been developed to study the transcriptome of cancer cells. It has been used in many studies for tumor classification and of identification of marker genes associated with cancer. However, this technique often suffers the `curse of dimensionality´. A general approach to overcome this setback is to perform feature selection technique prior to classification. Biomarkers have long been used for the prognosis and diagnosis of different types of diseases. The need for new and more specific biomarkers for leukemia arises. In this study gene selection was approached first using gene filtering by determining the expressions inter-quartile ranges (IQR) of the genes and determining whether or not they are differentially expressed across the different sample types by using the Kruskal-Wallis analysis of variance (ANOVA). Filtered genes were then subjected to k-means clustering algorithm to identify candidate genes that can be used to discriminate the four main types of leukemia (ALL, AML, CLL, CML) and non-leukemia (NoL) bone marrow samples. The selected genes were then used to build classification models using Support Vector Machine (SVM) and Artificial Neural Network (ANN) learning algorithms. Forty samples were used to build the models and 20 samples were used to assess the models performance. A minimum of 6 genes was found to be needed to correctly classify all samples in the training dataset into the five categories and to classify the samples in the validation dataset with high accuracy.
Keywords :
biomedical equipment; bone; cancer; cellular biophysics; feature selection; filtering theory; genetics; lab-on-a-chip; learning (artificial intelligence); medical diagnostic computing; neural nets; patient diagnosis; pattern classification; pattern clustering; statistical analysis; support vector machines; tumours; ALL; AML; ANN learning algorithms; ANOVA; CLL; CML; DNA microarray; IQR; Kruskal-Wallis analysis-of-variance; SVM; artificial neural network learning algorithms; biomarkers; cancer cells; cluster center genes; diseases diagnosis; diseases prognosis; feature selection; filtered genes; gene filtering; gene selection; inter-quartile ranges; k-means clustering algorithm; leukemia classification; nonleukemia bone marrow samples; support vector machine; training dataset; transcriptome; tumor classification; Analytical models; Artificial neural networks; Biological system modeling; DNA; Indexes; Predictive models; Training; ANN; SVM; biomarkers; leukemia; microarray; transcriptome;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information, Intelligence, Systems and Applications, IISA 2014, The 5th International Conference on
Conference_Location :
Chania
Type :
conf
DOI :
10.1109/IISA.2014.6878769
Filename :
6878769
Link To Document :
بازگشت