Title :
Fuzzy Preference Based Feature Selection and Semisupervised SVM for Cancer Classification
Author :
Maulik, Ujjwal ; Chakraborty, Debasis
Author_Institution :
Dept. of Comput. Sci. & Eng., Jadavpur Univ., Kolkata, India
Abstract :
DNA microarray data now permit scientists to screen thousand of genes simultaneously and determine whether those genes are active or silent in normal and cancerous tissues. With the advancement of microarray technology, new analytical methods must be developed to find out whether microarray data have discriminative signatures of gene expression over normal or cancerous tissues. In this paper, we attempt a prediction scheme that combines fuzzy preference based rough set (FPRS) method for feature (gene) selection with semisupervised SVMs. To show the effectiveness of the proposed approach, we compare the performance of this technique with the signal-to-noise ratio (SNR) and consistency based feature selection (CBFS) methods. Using six benchmark gene microarray datasets (including both binary and multi-class classification problems), we demonstrate experimentally that our proposed scheme can achieve significant empirical success and is biologically relevant for cancer diagnosis and drug discovery.
Keywords :
bioinformatics; cancer; feature selection; fuzzy set theory; genetics; lab-on-a-chip; pattern classification; support vector machines; DNA microarray data; cancer classification; cancer diagnosis; consistency based feature selection methods; drug discovery; feature gene selection; fuzzy preference based feature selection; gene microarray datasets; semisupervised SVM; signal-to-noise ratio; support vector machines; Accuracy; Approximation methods; Cancer; Nanobioscience; Support vector machines; Training; Tumors; Cancer classification; fuzzy preference based rough set; gene selection; microarray cancer data; semisupervised learning; transductive support vector machines;
Journal_Title :
NanoBioscience, IEEE Transactions on
DOI :
10.1109/TNB.2014.2312132