Title :
Improvement of clustal-derived sequence alignments with evolutionary algorithms
Author :
Thomsen, Rene ; Fogel, Gary B. ; Krink, Thiemo
Author_Institution :
Dept. of Comput. Sci., Aarhus Univ., Denmark
Abstract :
Multiple sequence alignment (MSA) is a central problem in bioinformatics. In this study, we extended previous efforts using evolutionary algorithms (EAs) for MSA. Candidate solutions in the initial population were derived from the well-known alignment program Clustal X. Evolutionary computation was then used to evolve increasingly appropriate solutions. Three new alignment operators were introduced and tested within the framework of protein sequence alignment. Statistics on alignment quality were generated with respect to selected alignment benchmarks from the BAliBASE database using the BLOSUM 62 substitution matrix. Our results indicate the degree to which EAs can enhance the results of Clustal X. Moreover, the experimental results show that the commonly used sum-of-pairs scoring scheme sometimes fails to correlate higher scoring alignments with increase in alignment quality in terms of the BAliBASE sum-of-pairs score.
Keywords :
biology computing; computational complexity; evolutionary computation; matrix algebra; proteins; sequences; BAliBASE database; BAliBASE sum-of-pairs score; BLOSUM 62 substitution matrix; Clustal X; alignment benchmarks; alignment operator; alignment program; alignment quality statistics; bioinformatics; clustal-derived sequence alignment; evolutionary algorithms; evolutionary computation; multiple sequence alignment; protein sequence alignment framework; sum-of-pairs scoring scheme; Benchmark testing; Bioinformatics; Computer science; DNA; Databases; Evolution (biology); Evolutionary computation; Protein sequence; Statistics; Stochastic processes;
Conference_Titel :
Evolutionary Computation, 2003. CEC '03. The 2003 Congress on
Print_ISBN :
0-7803-7804-0
DOI :
10.1109/CEC.2003.1299591