• DocumentCode
    3409855
  • Title

    Imputation of missing values in DNA microarray gene expression data

  • Author

    Kim, Hyunsoo ; Golub, Gene H. ; Park, Haesun

  • Author_Institution
    Minnesota Univ., Minneapolis, MN, USA
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    572
  • Lastpage
    573
  • Abstract
    Most multivariate statistical methods for gene expression data require a complete matrix of gene array values. In this paper, an imputation method based on least squares formulation is proposed to estimate missing values. It exploits local similarity structures in the data as well as least squares optimization process. The proposed local least squares imputation method (LLSimpute) represents a target gene that has missing values as a linear combination of similar genes. This algorithm showed better performance than the other imputation methods such as k-nearest neighbor imputation and an imputation method based on Bayesian principal component analysis.
  • Keywords
    DNA; biology computing; genetics; least squares approximations; molecular biophysics; optimisation; statistical analysis; Bayesian principal component analysis; DNA microarray gene expression data; k-nearest neighbor imputation; least squares optimization; local least squares imputation method; local similarity structures; missing values estimation; multivariate statistical methods; Bayesian methods; Computer science; DNA; Gene expression; Image resolution; Laboratories; Least squares approximation; Least squares methods; Principal component analysis; Statistical analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332500
  • Filename
    1332500