• DocumentCode
    2998559
  • Title

    Improvement of clustal-derived sequence alignments with evolutionary algorithms

  • Author

    Thomsen, Rene ; Fogel, Gary B. ; Krink, Thiemo

  • Author_Institution
    Dept. of Comput. Sci., Aarhus Univ., Denmark
  • Volume
    1
  • fYear
    2003
  • fDate
    8-12 Dec. 2003
  • Firstpage
    312
  • Abstract
    Multiple sequence alignment (MSA) is a central problem in bioinformatics. In this study, we extended previous efforts using evolutionary algorithms (EAs) for MSA. Candidate solutions in the initial population were derived from the well-known alignment program Clustal X. Evolutionary computation was then used to evolve increasingly appropriate solutions. Three new alignment operators were introduced and tested within the framework of protein sequence alignment. Statistics on alignment quality were generated with respect to selected alignment benchmarks from the BAliBASE database using the BLOSUM 62 substitution matrix. Our results indicate the degree to which EAs can enhance the results of Clustal X. Moreover, the experimental results show that the commonly used sum-of-pairs scoring scheme sometimes fails to correlate higher scoring alignments with increase in alignment quality in terms of the BAliBASE sum-of-pairs score.
  • Keywords
    biology computing; computational complexity; evolutionary computation; matrix algebra; proteins; sequences; BAliBASE database; BAliBASE sum-of-pairs score; BLOSUM 62 substitution matrix; Clustal X; alignment benchmarks; alignment operator; alignment program; alignment quality statistics; bioinformatics; clustal-derived sequence alignment; evolutionary algorithms; evolutionary computation; multiple sequence alignment; protein sequence alignment framework; sum-of-pairs scoring scheme; Benchmark testing; Bioinformatics; Computer science; DNA; Databases; Evolution (biology); Evolutionary computation; Protein sequence; Statistics; Stochastic processes;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2003. CEC '03. The 2003 Congress on
  • Print_ISBN
    0-7803-7804-0
  • Type

    conf

  • DOI
    10.1109/CEC.2003.1299591
  • Filename
    1299591