• DocumentCode
    2948068
  • Title

    A collateral missing value estimation algorithm for DNA microarrays

  • Author

    Sehgal, Muhammad Shoaib B ; Gondal, Iqbal ; Dooley, Laurence

  • Author_Institution
    Monash Univ., Clayton, Vic., Australia
  • Volume
    5
  • fYear
    2005
  • fDate
    18-23 March 2005
  • Abstract
    Genetic microarray expression data often contains multiple missing values that can significantly affect the performance of statistical and machine learning algorithms. This paper presents an innovative missing value estimation technique, called collateral missing value estimation (CMVE) which has demonstrated superior estimation performance compared with the K-nearest neighbour (KNN) imputation algorithm, the least square impute (LSImpute) and Bayesian principal component analysis (BPCA) techniques. Experimental results confirm that CMVE provides an improvement of 89%, 12% and 10% for the BRCA1, BRCA2 and sporadic ovarian cancer mutations, respectively, compared to the average error rate of KNN, LSImpute and BPCA imputation methods, over a range of randomly selected missing values. The underlying theory behind CMVE also means that it is not restricted to bioinformatics data, but can be successfully applied to any correlated data set.
  • Keywords
    Bayes methods; DNA; biology computing; error statistics; least squares approximations; parameter estimation; principal component analysis; Bayesian principal component analysis; DNA microarrays; K-nearest neighbour imputation algorithm; bioinformatics data; collateral missing value estimation algorithm; correlated data set; genetic microarray expression data; least square impute technique; machine learning algorithms; statistical algorithms; Bayesian methods; Bioinformatics; Cancer; DNA; Error analysis; Genetic mutations; Least squares approximation; Machine learning algorithms; Principal component analysis; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-8874-7
  • Type

    conf

  • DOI
    10.1109/ICASSP.2005.1416319
  • Filename
    1416319