DocumentCode
454968
Title
An Algorithm for Missing Value Estimation for DNA Microarray Data
Author
Friedland, Shmuel ; Niknejad, Amir ; Kaveh, Mostafa ; Zare, Hossein
Author_Institution
Dept. of Math., Stat. & Comput. Sci., Illinois Univ., Chicago, IL
Volume
2
fYear
2006
fDate
14-19 May 2006
Abstract
Gene expression data matrices often contain missing expression values. In this paper, we describe a new algorithm, named improved fixed rank approximation algorithm (IFRAA), for missing values estimations of the large gene expression data matrices. We compare the present algorithm with the two existing and widely used methods for reconstructing missing entries for DNA microarray gene expression data: the Bayesian principal component analysis (BPCA) and the local least squares imputation method (LLS). The three algorithms were applied to four microarray data sets and two synthetic low-rank data matrices. Certain percentages of the elements of these data sets were randomly deleted, and the three algorithms were used to recover them. In conclusion IFRAA appears to be the most reliable and accurate approach for recovering missing DNA microarray gene expression data, or any other noisy data matrices that are effectively low rank
Keywords
DNA; least squares approximations; principal component analysis; Bayesian principal component analysis; DNA microarray data; DNA microarray gene expression data; gene expression data matrices; improved fixed rank approximation algorithm; local least squares imputation method; missing value estimation; noisy data matrices; Approximation algorithms; Bayesian methods; Cancer; DNA; Gene expression; Least squares approximation; Least squares methods; Mathematics; Matrix decomposition; Principal component analysis; Bayesian analysis; Gene expression matrix; K-nearest neighbor; least squares; missing values imputation; principal component analysis; singular value decomposition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660537
Filename
1660537
Link To Document