DocumentCode :
2890550
Title :
Prediction and Evaluation of miRNA -- Target Gene Pairs Using K-means Clustering and Bipartite Graphs with Statistical Scoring
Author :
McCormick, Kevin ; Liao, Li
Author_Institution :
Univ. of Delaware, Newark, DE, USA
fYear :
2011
fDate :
12-15 Nov. 2011
Firstpage :
273
Lastpage :
277
Abstract :
Identifying microRNAs (miRNAs) and their target genes plays an increasingly important role in better understanding the regulatory activities in the cell. Most computational methods focus on the sequence complementarity between miRNAs and target genes without using actual expression data, which, even when used, has been primarily just for validation of the predicted relationship between specific miRNAs and genes. Recent findings have shown that many targets are missed by sequence-based approaches. In this work, we present a robust method to predict and evaluate miRNA-gene pairs based on their positional (time-course) expression data from next-generation sequencing and DNA microarray. The method first uses K-means clustering to group miRNAs and genes respectively, and then assigns miRNA-gene pairs to a bipartite graph with statistical scoring. The method is tested by ten-fold cross validation on two datasets in Arabidopsis, achieving a performance of about 0.70 ROC score.
Keywords :
RNA; biology computing; graph theory; lab-on-a-chip; pattern clustering; statistical analysis; Arabidopsis; DNA microarray; bipartite graphs; cell regulatory activities; k-means clustering; miRNA target gene pairs; microRNA identification; next-generation sequencing; positional expression data; sequence complementarity; sequence-based approach; statistical scoring; ten-fold cross validation; Bioinformatics; Bipartite graph; Computational modeling; Educational institutions; Genomics; Testing; K-means clustering; bipartite graph; miRNA target prediction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1799-4
Type :
conf
DOI :
10.1109/BIBM.2011.117
Filename :
6120450
Link To Document :
بازگشت