Title :
An Algorithm for Missing Value Estimation for DNA Microarray Data
Author :
Friedland, Shmuel ; Niknejad, Amir ; Kaveh, Mostafa ; Zare, Hossein
Author_Institution :
Dept. of Math., Stat. & Comput. Sci., Illinois Univ., Chicago, IL
Abstract :
Gene expression data matrices often contain missing expression values. In this paper, we describe a new algorithm, named improved fixed rank approximation algorithm (IFRAA), for missing values estimations of the large gene expression data matrices. We compare the present algorithm with the two existing and widely used methods for reconstructing missing entries for DNA microarray gene expression data: the Bayesian principal component analysis (BPCA) and the local least squares imputation method (LLS). The three algorithms were applied to four microarray data sets and two synthetic low-rank data matrices. Certain percentages of the elements of these data sets were randomly deleted, and the three algorithms were used to recover them. In conclusion IFRAA appears to be the most reliable and accurate approach for recovering missing DNA microarray gene expression data, or any other noisy data matrices that are effectively low rank
Keywords :
DNA; least squares approximations; principal component analysis; Bayesian principal component analysis; DNA microarray data; DNA microarray gene expression data; gene expression data matrices; improved fixed rank approximation algorithm; local least squares imputation method; missing value estimation; noisy data matrices; Approximation algorithms; Bayesian methods; Cancer; DNA; Gene expression; Least squares approximation; Least squares methods; Mathematics; Matrix decomposition; Principal component analysis; Bayesian analysis; Gene expression matrix; K-nearest neighbor; least squares; missing values imputation; principal component analysis; singular value decomposition;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
Print_ISBN :
1-4244-0469-X
DOI :
10.1109/ICASSP.2006.1660537