DocumentCode :
2112737
Title :
Long intergenic non-coding RNA detection benefited from integrative modeling of (Epi)genomic data
Author :
Jie Lv ; Hongbo Liu ; Qiong Wu
Author_Institution :
Sch. of Life Sci. & Technol., Harbin Inst. of Technol., Harbin, China
fYear :
2013
fDate :
23-25 July 2013
Firstpage :
735
Lastpage :
740
Abstract :
Prediction of long intergenic non-coding RNAs (lincRNAs) is a prerequisite to analyze sequence features of non-coding RNAs and explore their regulatory function. Genomic sequence features provide fundamental backgrounds for lincRNA predictions, due to that sequence information at least partially aids such predictions. However, genomic sequence alone seems to reach an end involving sensitivity for lincRNA prediction in eukaryotes. Chromatin factors leave marks that can be captured by high-throughput approaches such as ChIP-seq are also important features, as revealed by previous studies. We demonstrate that the performance of lincRNA predictions can be improved when incorporating both high-throughput chromatin modification and genomic sequence features by logistic regression with LASSO regularization. The discriminating features include H3K4me1, H3K27ac, H3K9me3, Open Reading Frames and several repeat elements. Importantly, chromatin information is suggested to be complementary to genomic sequence information, highlighting the importance of an integrated model. We also show that the lincRNA expression specificity can be efficiently modeled by the chromatin data with same developmental stage. The study not only supports the biological hypothesis that chromatin factors can regulate developmental-stage-specific expression of lincRNAs, also reveals the discriminating features between lincRNA and coding genes.
Keywords :
biology computing; genomics; molecular biophysics; H3K27ac; H3K4me1; H3K9me3; LASSO regularization; chromatin factors; genomic data; genomic sequence; high-throughput chromatin modification; integrative data modeling; lincRNA developmental-stage-specific expression; lincRNA predictions; logistic regression; long intergenic noncoding RNA detection; open reading frames; regulatory function; sequence information; Bioinformatics; Data models; Encoding; Genomics; Mice; Predictive models; RNA; Long intergenic non-coding RNAs; brain development; chromatin features; integrative model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery (FSKD), 2013 10th International Conference on
Conference_Location :
Shenyang
Type :
conf
DOI :
10.1109/FSKD.2013.6816292
Filename :
6816292
Link To Document :
بازگشت