Title of article :
Direct integration of microarrays for selecting informative genes and phenotype classification
Author/Authors :
Youngmi Yoon، نويسنده , , Jongchan Lee، نويسنده , , Sanghyun Park، نويسنده , , Sangjay Bien، نويسنده , , Hyun Cheol Chung، نويسنده , , Sun Young Rha، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2008
Pages :
18
From page :
88
To page :
105
Abstract :
The ability to provide thousands of gene expression values simultaneously makes microarray data very useful for phenotype classification. A major constraint in phenotype classification is that the number of genes greatly exceeds the number of samples. We overcame this constraint in two ways; we increased the number of samples by integrating independently generated microarrays that had been designed with the same biological objectives, and reduced the number of genes involved in the classification by selecting a small set of informative genes. We were able to maximally use the abundant microarray data that is being stockpiled by thousands of different research groups while improving classification accuracy. Our goal is to implement a feature (gene) selection method that can be applicable to integrated microarrays as well as to build a highly accurate classifier that permits straightforward biological interpretation. In this paper, we propose a two-stage approach. Firstly, we performed a direct integration of individual microarrays by transforming an expression value into a rank value within a sample and identified informative genes by calculating the number of swaps to reach a perfectly split sequence. Secondly, we built a classifier which is a parameter-free ensemble method using only the pre-selected informative genes. By using our classifier that was derived from large, integrated microarray sample datasets, we achieved high accuracy, sensitivity, and specificity in the classification of an independent test dataset.
Keywords :
DATA MINING , Microarray data analysis , Microarray data classification , Informative gene selection , Microarray data integration
Journal title :
Information Sciences
Serial Year :
2008
Journal title :
Information Sciences
Record number :
1212143
Link To Document :
بازگشت