DocumentCode :
2465192
Title :
Evaluating Distance Measures for RNA Motif Search
Author :
Schonfeld, Justin ; Ashlock, Daniel
Author_Institution :
Iowa State Univ., Ames
fYear :
0
fDate :
0-0 0
Firstpage :
2331
Lastpage :
2338
Abstract :
This paper extends an earlier study which outlined a bioinformatic pipeline for exploratory search for RNA motifs incorporating both primary and secondary structure. The pipeline is applied to three data sets, one of which is a larger version of that used in the earlier study. Instead of a single method of estimating the distance between RNA folds four distance measures were tested. The data sets are: a set of random control sequences, a set of synthetic sequences with simple designed folds, and the iron response element data set for which actual biological RNA folds are available. The pipeline demonstrates the ability to produce clusters that contain known motifs in the biological data and those designed into the synthetic data. The results for the distance measures varies substantially and one of the measures, difference in energy, is found to be too simplistic to be useful for differentiating motifs. The other three distance measures all demonstrate some degree of merit. At the heart of the pipeline is a non-linear projection algorithm that uses evolutionary computation to display the intra-RNA-fold distances so that the various distance measures can be visually compared. While the performance of this algorithm is acceptable, suggestions for improving it are made.
Keywords :
biology computing; dynamic programming; evolutionary computation; macromolecules; molecular biophysics; pattern clustering; search problems; sequences; RNA motif search; bioinformatic pipeline; clustering methods; distance measures; dynamic programming algorithm; evolutionary computation; intra-RNA-fold distances; iron response element data set; nonlinear projection algorithm; random control sequences; synthetic sequences; Bioinformatics; Biological control systems; Energy measurement; Evolutionary computation; Heart; Iron; Pipelines; Projection algorithms; RNA; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Evolutionary Computation, 2006. CEC 2006. IEEE Congress on
Conference_Location :
Vancouver, BC
Print_ISBN :
0-7803-9487-9
Type :
conf
DOI :
10.1109/CEC.2006.1688596
Filename :
1688596
Link To Document :
بازگشت