DocumentCode
140803
Title
Investigation of factors affecting RNA-seq gene expression calls
Author
Harati, Sahar ; Phan, John H. ; Wang, May Dongmei
Author_Institution
Dept. of Biomed. Eng., Emory Univ., Atlanta, GA, USA
fYear
2014
fDate
26-30 Aug. 2014
Firstpage
5232
Lastpage
5235
Abstract
RNA-seq enables quantification of the human transcriptome. Estimation of gene expression is a fundamental issue in the analysis of RNA-seq data. However, there is an inherent ambiguity in distinguishing between genes with very low expression and experimental or transcriptional noise. We conducted an exploratory investigation of some factors that may affect gene expression calls. We observed that the distribution of reads that map to exonic, intronic, and intergenic regions are distinct. These distributions may provide useful insights into the behavior of gene expression noise. Moreover, we observed that these distributions are qualitatively similar between two sequence mapping algorithms. Finally, we examined the relationship between gene length and gene expression calls, and observed that they are correlated. This preliminary investigation is important for RNA-seq gene expression analysis because it may lead to more effective algorithms for distinguishing between true gene expression and experimental or transcriptional noise.
Keywords
RNA; genetics; genomics; RNA-seq data analysis; RNA-seq gene expression calls; exonic regions; gene expression estimation; gene expression noise; gene length; human transcriptome; intergenic regions; intronic regions; sequence mapping algorithms; transcriptional noise; Algorithm design and analysis; Bioinformatics; Estimation; Gene expression; Genomics; Noise; Pipelines;
fLanguage
English
Publisher
ieee
Conference_Titel
Engineering in Medicine and Biology Society (EMBC), 2014 36th Annual International Conference of the IEEE
Conference_Location
Chicago, IL
ISSN
1557-170X
Type
conf
DOI
10.1109/EMBC.2014.6944805
Filename
6944805
Link To Document