DocumentCode
667258
Title
QLZCClust: Quaternary lempel-Ziv complexity based clustering of the RNA-seq read block segments
Author
Biswas, A.K. ; Baoju Zhang ; Xiaoyong Wu ; Gao, James Xiaoyu
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
fYear
2013
fDate
10-13 Nov. 2013
Firstpage
1
Lastpage
4
Abstract
The Next Generation Sequencing platform, RNA-seq provides quantitative expression data that exhibit distinctive sequence patterns in the segments of the short-reads level and are found useful in clustering of those segments. However, the result does not reflect the functional chemistry of the non-coding RNAs (ncRNAs). The functions of the ncRNAs are deeply related to their secondary structures. Thus by exploring the clustering in terms of structural profiles of the read block segments rather than their sequence patterns would be essential and useful. We proposed the QLZCClust (Quaternary Lempel-Ziv complexity based Clustering) method which is an extension to the popular Lempel-Ziv algorithm to compute pairwise secondary structure distance. We applied QLZCClust on the short-read segments obtained from the RNA-seq experient and found that it can separate most miRNAs and the tRNAs. Moreover, it can be used to detect structural similarities among different classes of ncRNAs. We compared our algorithm with the clustering of two other structural distance measures - SimTree edit distance and RNAz based distance, and found that our method performs superior.
Keywords
RNA; biology computing; pattern clustering; QLZCClust method; RNA-seq read block segments; RNAz based distance; SimTree edit distance; functional chemistry; miRNAs; next generation sequencing platform; noncoding RNAs; pairwise secondary structure distance; quantitative expression data; quaternary Lempel-Ziv complexity based clustering method; read block segments; sequence patterns; short-read segments; structural profiles; structural similarity detection; tRNAs; Clustering algorithms; Complexity theory; Genomics; History; Indexes; Production; RNA;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
Conference_Location
Chania
Type
conf
DOI
10.1109/BIBE.2013.6701596
Filename
6701596
Link To Document