Title :
Dynamic Methods for Missing Value Estimation for DNA Sequences
Author :
Qin, Fen ; Lee, Jeonghwa
Author_Institution :
Shippensburg Univ., Shippensburg, PA, USA
Abstract :
Many gene expressions in the DNA micro array and gene sequences have missing values in the datasets. It is critical to estimate these missing values accurately, because most of the existing algorithms for gene expression analysis require the complete DNA dataset as an input, which affects the performance of gene classifications. This paper introduces dynamic local least squares imputation (DLLSimpute), which selects local matrices dynamically and consequently uses more gene information each time to recover the missing values. Numerical results show that the DLLSimpute recovers missing values more accurately than the k-nearest neighboring imputation (KNNimpute) and the local least squares imputation (LLSimpute) regardless of the completeness of the datasets.
Keywords :
DNA; biology computing; data analysis; genetics; least squares approximations; DLLSimpute; DNA data analysis algorithms; DNA micro array; DNA sequences; KNNimpute; LLSimpute; dynamic local least squares imputation; gene expressions; gene sequences; k-nearest neighboring imputation; local least squares imputation; missing value estimation; Algorithm design and analysis; DNA; Equations; Estimation; Gene expression; Jacobian matrices; Mathematical model;
Conference_Titel :
Computational and Information Sciences (ICCIS), 2010 International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-8814-8
Electronic_ISBN :
978-0-7695-4270-6
DOI :
10.1109/ICCIS.2010.115