DocumentCode
2998559
Title
Improvement of clustal-derived sequence alignments with evolutionary algorithms
Author
Thomsen, Rene ; Fogel, Gary B. ; Krink, Thiemo
Author_Institution
Dept. of Comput. Sci., Aarhus Univ., Denmark
Volume
1
fYear
2003
fDate
8-12 Dec. 2003
Firstpage
312
Abstract
Multiple sequence alignment (MSA) is a central problem in bioinformatics. In this study, we extended previous efforts using evolutionary algorithms (EAs) for MSA. Candidate solutions in the initial population were derived from the well-known alignment program Clustal X. Evolutionary computation was then used to evolve increasingly appropriate solutions. Three new alignment operators were introduced and tested within the framework of protein sequence alignment. Statistics on alignment quality were generated with respect to selected alignment benchmarks from the BAliBASE database using the BLOSUM 62 substitution matrix. Our results indicate the degree to which EAs can enhance the results of Clustal X. Moreover, the experimental results show that the commonly used sum-of-pairs scoring scheme sometimes fails to correlate higher scoring alignments with increase in alignment quality in terms of the BAliBASE sum-of-pairs score.
Keywords
biology computing; computational complexity; evolutionary computation; matrix algebra; proteins; sequences; BAliBASE database; BAliBASE sum-of-pairs score; BLOSUM 62 substitution matrix; Clustal X; alignment benchmarks; alignment operator; alignment program; alignment quality statistics; bioinformatics; clustal-derived sequence alignment; evolutionary algorithms; evolutionary computation; multiple sequence alignment; protein sequence alignment framework; sum-of-pairs scoring scheme; Benchmark testing; Bioinformatics; Computer science; DNA; Databases; Evolution (biology); Evolutionary computation; Protein sequence; Statistics; Stochastic processes;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation, 2003. CEC '03. The 2003 Congress on
Print_ISBN
0-7803-7804-0
Type
conf
DOI
10.1109/CEC.2003.1299591
Filename
1299591
Link To Document