Title of article :
Gene selection and classification using Taguchi chaotic binary particle swarm optimization
Author/Authors :
Chuang، نويسنده , , Li-Yeh and Yang، نويسنده , , Cheng-San and Wu، نويسنده , , Kuo-Chuan and Yang، نويسنده , , Cheng-Hong، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2011
Pages :
11
From page :
13367
To page :
13377
Abstract :
The purpose of gene expression analysis is to discriminate between classes of samples, and to predict the relative importance of each gene for sample classification. Microarray data with reference to gene expression profiles have provided some valuable results related to a variety of problems and contributed to advances in clinical medicine. Microarray data characteristically have a high dimension and a small sample size. This makes it difficult for a general classification method to obtain correct data for classification. However, not every gene is potentially relevant for distinguishing the sample class. Thus, in order to analyze gene expression profiles correctly, feature (gene) selection is crucial for the classification process, and an effective gene extraction method is necessary for eliminating irrelevant genes and decreasing the classification error rate. s paper, correlation-based feature selection (CFS) and the Taguchi chaotic binary particle swarm optimization (TCBPSO) were combined into a hybrid method. The K-nearest neighbor (K-NN) with leave-one-out cross-validation (LOOCV) method served as a classifier for ten gene expression profiles. Experimental results show that this hybrid method effectively simplifies features selection by reducing the number of features needed. The classification error rate obtained by the proposed method had the lowest classification error rate for all of the ten gene expression data set problems tested. For six of the gene expression profile data sets a classification error rate of zero could be reached. The introduced method outperformed five other methods from the literature in terms of classification error rate. It could thus constitute a valuable tool for gene expression analysis in future studies.
Keywords :
Correlation-based feature selection , K-nearest neighbor , Taguchi-binary particle swarm optimization , Microarray data
Journal title :
Expert Systems with Applications
Serial Year :
2011
Journal title :
Expert Systems with Applications
Record number :
2350410
Link To Document :
بازگشت