DocumentCode :
3032075
Title :
Missing value imputation methods for gene-sample-time microarray data analysis
Author :
Li, Yifeng ; Ngom, Alioune ; Rueda, Luis
Author_Institution :
Sch. of Comput. Sci., Univ. of Windsor, Windsor, ON, Canada
fYear :
2010
fDate :
2-5 May 2010
Firstpage :
1
Lastpage :
7
Abstract :
With the recent advances in microarray technology, the expression levels of genes with respect to the samples can be monitored synchronically over a series of time points. Such three-dimensional microarray data, termed gene-sample-time microarray data or GST data for short, may contain missing values. Current microarray analysis methods require complete data sets, and thus, either each row, column or tube containing missing values must be removed from the original GST data, or these missing values must be estimated before analysis. Imputation of missing values is, however, more recommended than removal of data in order to increase the effectiveness of analysis algorithms. In this paper, we extend automated imputation methods, devised for two-dimensional microarray data, to GST data. We implemented imputation methods for GST data based on Singular Value Decomposition (3SVDimpute), K-Nearest Neighbor (3KNNimpute), and gene and sample average methods (3Aimpute), and show that methods based on KNN yield the best results with the lowest normalized root mean squared error.
Keywords :
biology computing; data analysis; molecular biophysics; pattern classification; singular value decomposition; statistical analysis; 2D microarray data; 3Aimpute; 3D microarray data; 3KNNimpute; 3SVDimpute; automated imputation method; gene sample time microarray data analysis; k-nearest neighbor; missing value imputation method; sample average methods; singular value decomposition; Acceleration; Algorithm design and analysis; Data analysis; Gene expression; Genetics; Monitoring; Robustness; Singular value decomposition; Space technology; Tensile stress;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2010 IEEE Symposium on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4244-6766-2
Type :
conf
DOI :
10.1109/CIBCB.2010.5510349
Filename :
5510349
Link To Document :
بازگشت