Title :
QLZCClust: Quaternary lempel-Ziv complexity based clustering of the RNA-seq read block segments
Author :
Biswas, A.K. ; Baoju Zhang ; Xiaoyong Wu ; Gao, James Xiaoyu
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Texas at Arlington, Arlington, TX, USA
Abstract :
The Next Generation Sequencing platform, RNA-seq provides quantitative expression data that exhibit distinctive sequence patterns in the segments of the short-reads level and are found useful in clustering of those segments. However, the result does not reflect the functional chemistry of the non-coding RNAs (ncRNAs). The functions of the ncRNAs are deeply related to their secondary structures. Thus by exploring the clustering in terms of structural profiles of the read block segments rather than their sequence patterns would be essential and useful. We proposed the QLZCClust (Quaternary Lempel-Ziv complexity based Clustering) method which is an extension to the popular Lempel-Ziv algorithm to compute pairwise secondary structure distance. We applied QLZCClust on the short-read segments obtained from the RNA-seq experient and found that it can separate most miRNAs and the tRNAs. Moreover, it can be used to detect structural similarities among different classes of ncRNAs. We compared our algorithm with the clustering of two other structural distance measures - SimTree edit distance and RNAz based distance, and found that our method performs superior.
Keywords :
RNA; biology computing; pattern clustering; QLZCClust method; RNA-seq read block segments; RNAz based distance; SimTree edit distance; functional chemistry; miRNAs; next generation sequencing platform; noncoding RNAs; pairwise secondary structure distance; quantitative expression data; quaternary Lempel-Ziv complexity based clustering method; read block segments; sequence patterns; short-read segments; structural profiles; structural similarity detection; tRNAs; Clustering algorithms; Complexity theory; Genomics; History; Indexes; Production; RNA;
Conference_Titel :
Bioinformatics and Bioengineering (BIBE), 2013 IEEE 13th International Conference on
Conference_Location :
Chania
DOI :
10.1109/BIBE.2013.6701596