• DocumentCode
    3685422
  • Title

    Effect of low-expression gene filtering on detection of differentially expressed genes in RNA-seq data

  • Author

    Ying Sha;John H. Phan;May D. Wang

  • Author_Institution
    School of Biology, Georgia Institute of Technology, Atlanta, USA
  • fYear
    2015
  • Firstpage
    6461
  • Lastpage
    6464
  • Abstract
    We compare methods for filtering RNA-seq lowexpression genes and investigate the effect of filtering on detection of differentially expressed genes (DEGs). Although RNA-seq technology has improved the dynamic range of gene expression quantification, low-expression genes may be indistinguishable from sampling noise. The presence of noisy, low-expression genes can decrease the sensitivity of detecting DEGs. Thus, identification and filtering of these low-expression genes may improve DEG detection sensitivity. Using the SEQC benchmark dataset, we investigate the effect of different filtering methods on DEG detection sensitivity. Moreover, we investigate the effect of RNA-seq pipelines on optimal filtering thresholds. Results indicate that the filtering threshold that maximizes the total number of DEGs closely corresponds to the threshold that maximizes DEG detection sensitivity. Transcriptome reference annotation, expression quantification method, and DEG detection method are statistically significant RNA-seq pipeline factors that affect the optimal filtering threshold.
  • Keywords
    "Filtering","Pipelines","Bioinformatics","Sensitivity","Yttrium","Gene expression","Genomics"
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2015 37th Annual International Conference of the IEEE
  • ISSN
    1094-687X
  • Electronic_ISBN
    1558-4615
  • Type

    conf

  • DOI
    10.1109/EMBC.2015.7319872
  • Filename
    7319872