DocumentCode :
1640345
Title :
Combining singular value decomposition and t-test into hybrid approach for significant gene extraction from microarray data
Author :
Alshalalfa, Mohammed ; Alhajj, Reda ; Rokne, Jon
Author_Institution :
Dept. of Comput. Sci., Univ. of Calgary, Calgary, AB
fYear :
2008
Firstpage :
1
Lastpage :
6
Abstract :
Significant gene extraction from microarray data is a challenging problem which is of great interest to researchers in Computational Biology, Medicine, Computer Science and Statistics. A number of methods have been proposed for extracting the smallest number of genes which can accurately classify different samples. Most of these methods ignore the fact that microarray data is mostly noisy. For instance, only using a statistical t-test has been shown to be insufficient since it result in a high false discovery rate. Recently, a singular value decomposition (SVD) based approach was proposed for time series microarray data reduction, however it turned out not to be efficient for classifying microarray data. To overcome the shortcomings of these approaches, this paper proposes two methods to reduce false discovery rates. The first method involves an iterative t-test which finds the p-value for each gene under perturbation by eliminating one sample at a time. It eliminates weak noisy genes by dropping any gene which does not show significant p-value under all the conditions. The second method is a hybrid process which adapts a combination of the SVD and the t-test. It considers the entropy of all the data, and thus takes the correlation between genes into account. Classification accuracy is used to validate the significance of the extracted genes. The reported test results on two datasets demonstrate the applicability and effectiveness of the two proposed methods.
Keywords :
bioinformatics; entropy; genomics; iterative methods; singular value decomposition; statistical analysis; classification accuracy; data entropy; false discovery rate reduction; gene p-value; iterative t-test; microarray data gene extraction; singular value decomposition; statistical t-test; time series microarray data reduction; Association rules; Computer science; Data analysis; Data mining; Entropy; Gene expression; Singular value decomposition; Support vector machine classification; Support vector machines; Testing; entropy; gene extraction; gene reduction; microarray data; p-value; singular value decomposition; t-test;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
BioInformatics and BioEngineering, 2008. BIBE 2008. 8th IEEE International Conference on
Conference_Location :
Athens
Print_ISBN :
978-1-4244-2844-1
Electronic_ISBN :
978-1-4244-2845-8
Type :
conf
DOI :
10.1109/BIBE.2008.4696691
Filename :
4696691
Link To Document :
بازگشت