DocumentCode :
1681543
Title :
A parallel architecture for regulatory motif algorithm assessment
Author :
Quest, Daniel ; Dempsey, Kathryn ; Shaflullah, M. ; Bastola, Dhundy ; Ali, Hesham
Author_Institution :
Coll. of Inf. Technol., Univ. of Nebraska at Omaha, Omaha, NE
fYear :
2008
Firstpage :
1
Lastpage :
8
Abstract :
Computational discovery of cis-regulatory motifs has become one of the more challenging problems in bioinformatics. In recent years, over 150 methods have been proposed as solutions, however, it remains difficult to characterize the advantages and disadvantages of these approaches because of the wide variability of approaches and datasets. Although biologists desire a set of parameters and a program most appropriate for cis-regulatory discovery in their domain of interest, compiling such a list is a great computational challenge. First, a discovery pipeline for 150+ methods must be automated and then each dataset of interest must used to grade the methods. Automation is challenging because these programs are intended to be used over a small set of sites and consequently have many manual steps intended to help the user in fine-tuning the program to specific problems or organisms. If a program is fine-tuned to parameters other than those used in the original paper, it is not guaranteed to have the same sensitivity and specificity. Consequently, there are few methods that rank motif discovery tools. This paper proposes a parallel framework for the automation and evaluation of cis-regulatory motif discovery tools. This evaluation platform can both run and benchmark motif discovery tools over a wide range of parameters and is the first method to consider both multiple binding locations within a regulatory region and regulatory regions of orthologous genes. Because of the large amount of tests required, we implemented this platform on a computing cluster to increase performance.
Keywords :
biology computing; parallel architectures; benchmark motif discovery tools; bioinformatics; biologists; cis-regulatory discovery; computational discovery; computing cluster; discovery pipeline; multiple binding locations; orthologous genes; parallel architecture; rank motif discovery tools; regulatory motif algorithm assessment; Automation; Bioinformatics; Biology computing; Concurrent computing; Educational institutions; Electronic mail; Information technology; Parallel architectures; Pathology; Pipelines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
Conference_Location :
Miami, FL
ISSN :
1530-2075
Print_ISBN :
978-1-4244-1693-6
Electronic_ISBN :
1530-2075
Type :
conf
DOI :
10.1109/IPDPS.2008.4536178
Filename :
4536178
Link To Document :
بازگشت