Title :
A Biologically Verified Classification of Microarray Data
Author :
Mondal, Ritwik ; Mahata, Bholanath ; Dasgupta, Srirupa
Author_Institution :
Dept. of Inf. Technol., Gov. Coll. of Eng. & Ceramic Technol., Kolkata, India
Abstract :
A micro array represents thousands of gene expression levels across a few samples. Determination of an optimal set of features from such a high dimensional dataset requires a good feature selection method. Based on statistical significance of the features, an elimination of insignificant genes can be performed. However such methods lack biological validation. In this paper we propose a method where statistically reduced gene set is biologically verified with the help of Gene ontology (GO). With this verified feature set classification is performed on three micro array datasets using Support vector machine (SVM) and Random forest (RF) classifiers. The classification accuracy determined using same test sets for both without and with ontological verification are found to be significantly improved.
Keywords :
biological techniques; biology computing; feature extraction; feature selection; genetics; lab-on-a-chip; pattern classification; random processes; statistical analysis; support vector machines; RF classifiers; SVM; biological validation; biological verified classification; feature selection; feature set classification; gene expression levels; gene ontology; high dimensional dataset; microarray datasets; random forest classifiers; statistical significance; support vector machine; Accuracy; Bioinformatics; Gene expression; Ontologies; Radio frequency; Support vector machines; feature selection; gene enrichment; gene ontology; microarray data; random forest; support vector machine; t-test;
Conference_Titel :
Computational Intelligence and Communication Networks (CICN), 2014 International Conference on
Print_ISBN :
978-1-4799-6928-9
DOI :
10.1109/CICN.2014.151