• DocumentCode
    3182199
  • Title

    Missing value estimation in microarray data using coregulation and similarity of genes

  • Author

    Paul, Amit ; Sil, Jaya

  • Author_Institution
    Comput. Sci. & Eng. Dept., St. Thomas Coll. of Eng. & Technol., Khidirpore, India
  • fYear
    2011
  • fDate
    11-14 Dec. 2011
  • Firstpage
    705
  • Lastpage
    710
  • Abstract
    Microarray experiments usually generate multiple missing values in gene expression data sets due to several reasons. In the paper, a robust method has been proposed to estimate the missing values in microarray data using biological knowledge of the genes. Missing values are imputed based on the similarity in their characteristics patterns observed among the co-regulated genes. In this approach, first the microarray data are normalized and then the normalized data is discretized to measure similarity between the genes. The estimation accuracy of the proposed method is compared with the existing K-Nearest Neighbor based method and Pattern Similarity Matching (PSM) considering 4000 genes generated from 192 experiments. The experimental results exhibit that the proposed method outperforms other methods in terms of accuracy.
  • Keywords
    DNA; biology computing; genetics; pattern classification; gene coregulation; gene expression data sets; gene similarity; k-nearest neighbor based method; microarray data; microarray experiments; missing value estimation; pattern similarity matching; Arrays; Bioinformatics; Clustering algorithms; Estimation; Gene expression; Oscillators; Coregulation; Microarray data; Oscillation factor; Similarity factor; Similarity probability;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Technologies (WICT), 2011 World Congress on
  • Conference_Location
    Mumbai
  • Print_ISBN
    978-1-4673-0127-5
  • Type

    conf

  • DOI
    10.1109/WICT.2011.6141332
  • Filename
    6141332