• DocumentCode
    951858
  • Title

    An Evaluation of Information Content as a Metric for the Inference of Putative Conserved Noncoding Regions in DNA Sequences Using a Genetic Algorithms Approach

  • Author

    Congdon, Clare Bates ; Aman, Joseph C. ; Nava, Gerardo M. ; Gaskins, H. Rex ; Mattingly, Carolyn J.

  • Author_Institution
    Univ. of Southern Maine, Portland
  • Volume
    5
  • Issue
    1
  • fYear
    2008
  • Firstpage
    1
  • Lastpage
    14
  • Abstract
    In previous work, we presented GAMI [1], an approach to motif inference that uses a genetic algorithms search. GAMI is designed specifically to find putative conserved regulatory motifs in noncoding regions of divergent species and is designed to allow for analysis of long nucleotide sequences. In this work, we compare GAMI´s performance when run with its original fitness function (a simple count of the number of matches) and when run with information content (IC), as well as several variations on these metrics. Results indicate that IC does not identify highly conserved regions and, thus, is not the appropriate metric for this task, whereas variations on IC, as well as the original metric, succeed in identifying putative conserved regions.
  • Keywords
    DNA; biocomputing; biological techniques; DNA sequences; GAMI; fitness function; genetic algorithms; information content; nucleotide sequences; putative conserved noncoding; Biology and genetics; Evolutionary computing and genetic algorithms; Algorithms; Animals; Base Sequence; Computational Biology; Confidence Intervals; Conserved Sequence; Cystic Fibrosis Transmembrane Conductance Regulator; Enhancer Elements (Genetics); Glutathione Transferase; Humans; Models, Genetic; Regulatory Sequences, Nucleic Acid; Sequence Alignment; Sequence Analysis, DNA; Sex-Determining Region Y Protein;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2007.1059
  • Filename
    4359854