Title :
Partial Least Squares Based Dimension Reduction with Gene Selection for Tumor Classification
Author :
Li, Guo-Zheng ; Zeng, Xue-Qiang ; Yang, Jack Y. ; Yang, Mary Qu
Author_Institution :
Shanghai Univ., Shanghai
Abstract :
Analyzing gene expression data from DNA microarrays by commonly used classifiers is a hard task, be-cause there are only a few observations but with thousands of measured genes in the data set. Partial least squares based dimension reduction (PLSDR) is superior to handling such high dimensional problem, but irrelevant features will introduce errors into the dimension reduction process and reduce the classification accuracy of learning machines. Here feature selection is applied to filter the data and an algorithm named PLSDRg is described by integrating PLSDR with gene selection, which can effectively improve classification accuracy of learning machines. Feature selection is performed by the indication of t-statistics scores on standardized probes. Experimental results on seven microarray data sets show that the proposed method PLSDRg is effective and reliable to improve the generalization performance of classifiers.
Keywords :
DNA; cancer; feature extraction; genetics; learning (artificial intelligence); medical computing; molecular biophysics; pattern classification; statistical analysis; tumours; DNA microarray; PLSDRg algorithm; feature selection; gene expression; gene selection; machine learning; partial least squares based dimension reduction; t-statistics; tumor classification; Bioinformatics; Computer science; DNA; Gene expression; Genomics; Humans; Least squares methods; Machine learning; Medical diagnostic imaging; Neoplasms; Dimension Reduction; Gene Selection; Partial Least Squares;
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
DOI :
10.1109/BIBE.2007.4375763