Title :
Workshop: Transcriptome assembly and isoform expression level estimation from biased RNA-Seq reads
Author :
Li, Wei ; Jiang, Tao
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of California, Riverside, CA, USA
Abstract :
Summary form only given: RNA-Seq uses the high-throughput sequencing technology to identify and quantify transcriptome at an unprecedented high resolution and low cost. However, RNA-Seq reads are usually not uniformly distributed and biases in RNA-Seq data post great challenges in many applications including transcriptome assembly and the expression level estimation of genes or isoforms. Much effort has been made in the literature to calibrate the expression level estimation from biased RNA-Seq data, but the effect of biases on transcriptome assembly remains largely unexplored. Here, we propose a statistical framework for both transcriptome assembly and isoform expression level estimation from biased RNA-Seq data. Using a quasi-multinomial distribution model, our method is able to capture various types of RNA-Seq biases, including positional, sequencing and mappability biases. Our experimental resultson simulated and real RNA-Seq datasets exhibit interesting effects of RNA-Seq biases on both transcriptome assembly and isoform expression level estimation. The advantage of our method is clearly shown in the experimental analysis by its high sensitivity and precision in transcriptome assembly and the high concordance of its estimated expression levels with qRT-PCR data.
Keywords :
RNA; biology computing; genetics; molecular biophysics; molecular configurations; RNA-Seq bias; RNA-Seq data; biased RNA-Seq read; gene; isoform expression level estimation; quasi-multinomial distribution model; sequencing technology; transcriptome assembly; Assembly; Bioinformatics; Data models; Educational institutions; Estimation; Genomics; RNA-Seq data analysis; component elimination EM; isoform expression level estimation; read bias correction; transcriptome assembly;
Conference_Titel :
Computational Advances in Bio and Medical Sciences (ICCABS), 2012 IEEE 2nd International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4673-1320-9
Electronic_ISBN :
978-1-4673-1319-3
DOI :
10.1109/ICCABS.2012.6182670