DocumentCode
1809302
Title
Discovering non-coding RNA elements in drosophila 3′ untranslated regions
Author
Zhong, Cuncong ; Andrews, Justen ; Zhang, Shaojie
Author_Institution
Dept. of EECS, Univ. of Central Florida, Orlando, FL, USA
fYear
2012
fDate
23-25 Feb. 2012
Firstpage
1
Lastpage
6
Abstract
The non-coding RNA (ncRNA) elements in the 3´ untranslated regions (3´-UTRs) are known to participate in the genes´ post-transcriptional regulation, such as their stability, translation efficiency, and subcellular localization. Inferring co-expression patterns of the genes by clustering their 3´-UTR ncRNA elements will provide invaluable knowledge for further studies of their functionalities and interactions under specific physiological processes. In this work, we propose an improved RNA structural clustering pipeline that takes into account the length-dependent distribution of the structural similarity measure. Benchmark of the proposed pipeline on Rfam data clearly demonstrates over 10% performance gain, compared to a traditional hierarchical clustering pipeline. By applying the proposed clustering pipeline to Drosophila melanogaster´s 3´-UTRs, we have successfully identified 184 ncRNA clusters, of which 91.3% appear to be true RNA structural elements, based on RNAz´s prediction. Among the clusters we have rediscovered the well-known histone ncRNA family as well as a number of other families whose potential functionalities may be inferred from existing studies. One of such families contains genes that are preferentially expressed in male Drosophila. In situ hybridization further reveals their characteristic `cup´ or `comet´ localization patterns in Drosophila testis. The complete clustering results are available at http://genome.ucf.edu/fly3UTRcluster.
Keywords
RNA; biology computing; cellular biophysics; genetics; molecular biophysics; molecular configurations; 3´-UTR ncRNA element; Drosophila 3´ untranslated region; Drosophila melanogaster 3´-UTR; Drosophila testis; RNA structural clustering pipeline; RNA structural element; Rfam data; comet localization pattern; cup localization pattern; gene; gene co-expression pattern; gene post-transcriptional regulation; hierarchical clustering pipeline; histone ncRNA family; in situ hybridization; male Drosophila; ncRNA cluster; noncoding RNA element; physiological process; subcellular localization; translation efficiency; Benchmark testing; Bioinformatics; Clustering algorithms; Clustering methods; Genomics; Pipelines; RNA; 3′ untranslated regions; non-coding RNA element; post transcriptional regulation; structural clustering; testis-specific gene expression;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Advances in Bio and Medical Sciences (ICCABS), 2012 IEEE 2nd International Conference on
Conference_Location
Las Vegas, NV
Print_ISBN
978-1-4673-1320-9
Electronic_ISBN
978-1-4673-1319-3
Type
conf
DOI
10.1109/ICCABS.2012.6182650
Filename
6182650
Link To Document