Title :
Effect of Outlier Removal on Gene Marker Selection Using Support Vector Machines
Author :
Moffitt, Richard ; Phan, John ; Hemby, Scott ; Wang, May
Author_Institution :
Wallace H. Coulter Dept. of Biomed. Eng., Georgia Inst. of Technol., Atlanta, GA
fDate :
6/27/1905 12:00:00 AM
Abstract :
Biological markers are useful tools for the diagnosis and prognosis of disease. Many different methods are currently used to extract markers from multiple data sources, including gene expression microarrays. This paper investigates the effect of outlier removal on the performance of one such biomarker selection method, support vector machines (SVM). A simple method of outlier removal is employed as a preprocessing step before the data is used for SVM analysis. Both linear and radial basis kernels are used as well as four different normalization techniques. Results show that outlier removal increases the number of highly predictive genes as well as the number of poorly predicting genes. This result thus supports the use of outlier removal prior to biological marker identification via SVM analysis
Keywords :
cellular biophysics; genetics; medical diagnostic computing; molecular biophysics; patient diagnosis; support vector machines; biological markers; biomarker selection method; disease diagnosis; disease prognosis; gene expression microarrays; gene marker selection; linear basis kernel; normalization; outlier removal; radial basis kernel; support vector machines; Biomarkers; Biomedical engineering; Data analysis; Diseases; Gene expression; Kernel; Medical diagnostic imaging; Physiology; Support vector machines; Testing;
Conference_Titel :
Engineering in Medicine and Biology Society, 2005. IEEE-EMBS 2005. 27th Annual International Conference of the
Conference_Location :
Shanghai
Print_ISBN :
0-7803-8741-4
DOI :
10.1109/IEMBS.2005.1616564