• DocumentCode
    3046789
  • Title

    High performance computational tools for Motif discovery

  • Author

    Baldwin, N.E. ; Collins, R.L. ; Langston, M.A. ; Symons, C.T. ; Leuze, M.R. ; Voy, B.H.

  • Author_Institution
    Dept. of Comput. Sci., Tennessee Univ., TN, USA
  • fYear
    2004
  • fDate
    26-30 April 2004
  • Firstpage
    192
  • Abstract
    Summary form only given. We highlight a fruitful interplay between biology and computation. The sequencing of complete genomes from multiple organisms has revealed that most differences in organism complexity are due to elements of gene regulation that reside in the non protein coding portions of genes. Both within and between species, transcription factor binding sites and the proteins that recognize them govern the activity of cellular pathways that mediate adaptive responses and survival. Experimental identification of these regulatory elements is by nature a slow process. The availability of complete genomic sequences, however, opens the door for computational methods to predict binding sites and expedite our understanding of gene regulation at a genomic level. Just as with traditional experimental approaches, the computational identification of the molecular factors that control a gene´s expression level has been problematic. As a case in point, the identification of putative motifs is a challenging combinatorial task. For it, powerful new motif finding algorithms and high performance implementations are described. Heavy use is made of graph algorithms, some of which are exceedingly computationally intensive and involve the use of emergent mathematical methods. An approach to fully dynamic load balancing is developed in order to make effective use of highly parallel platforms.
  • Keywords
    biology computing; cellular biophysics; genetics; microorganisms; molecular biophysics; parallel processing; proteins; resource allocation; cellular pathway; computational identification; dynamic load balancing; gene expression level; gene regulation; genome; graph algorithm; high performance computational tool; highly parallel platforms; molecular factor; motif discovery; nonprotein coding portion; transcription factor binding site; Bioinformatics; Computational Intelligence Society; DNA; Gene expression; Genomics; High performance computing; Humans; Mice; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
  • Print_ISBN
    0-7695-2132-0
  • Type

    conf

  • DOI
    10.1109/IPDPS.2004.1303210
  • Filename
    1303210