DocumentCode
2459344
Title
Dynamic Methods for Missing Value Estimation for DNA Sequences
Author
Qin, Fen ; Lee, Jeonghwa
Author_Institution
Shippensburg Univ., Shippensburg, PA, USA
fYear
2010
fDate
17-19 Dec. 2010
Firstpage
442
Lastpage
445
Abstract
Many gene expressions in the DNA micro array and gene sequences have missing values in the datasets. It is critical to estimate these missing values accurately, because most of the existing algorithms for gene expression analysis require the complete DNA dataset as an input, which affects the performance of gene classifications. This paper introduces dynamic local least squares imputation (DLLSimpute), which selects local matrices dynamically and consequently uses more gene information each time to recover the missing values. Numerical results show that the DLLSimpute recovers missing values more accurately than the k-nearest neighboring imputation (KNNimpute) and the local least squares imputation (LLSimpute) regardless of the completeness of the datasets.
Keywords
DNA; biology computing; data analysis; genetics; least squares approximations; DLLSimpute; DNA data analysis algorithms; DNA micro array; DNA sequences; KNNimpute; LLSimpute; dynamic local least squares imputation; gene expressions; gene sequences; k-nearest neighboring imputation; local least squares imputation; missing value estimation; Algorithm design and analysis; DNA; Equations; Estimation; Gene expression; Jacobian matrices; Mathematical model;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational and Information Sciences (ICCIS), 2010 International Conference on
Conference_Location
Chengdu
Print_ISBN
978-1-4244-8814-8
Electronic_ISBN
978-0-7695-4270-6
Type
conf
DOI
10.1109/ICCIS.2010.115
Filename
5709119
Link To Document