DocumentCode
2443079
Title
Evolutionary computation techniques for multiple sequence alignment
Author
Cai, Liming ; Juedes, David ; Liakhovitch, Evgueni
Author_Institution
Sch. of Electron. Eng. & Comput. Sci., Ohio Univ., Athens, OH, USA
Volume
2
fYear
2000
fDate
2000
Firstpage
829
Abstract
Given a collection of biologically related protein or DNA sequences, the basic multiple sequence alignment problem is to determine the most biologically plausible alignment of these sequences. Under the assumption that the collection of sequences arose from some common ancestor, an alignment can be used to infer the evolutionary history among the sequences, i.e., the most likely pattern of insertions, deletions and mutations that transformed one sequence into another. The general multiple sequence alignment problem is known to be NP-hard, and hence the problem of finding the best possible multiple sequence alignment is intractable. However, this does not preclude the possibility of developing algorithms that produce near optimal multiple sequence alignments in polynomial time. We examine techniques to combine efficient algorithms for near optimal global and local multiple sequence alignment with evolutionary computation techniques to search for better near optimal sequence alignments. We describe our evolutionary computation approach to multiple sequence alignment and present preliminary simulation results on a set of 17 clusters of orthologous groups of proteins (COGs). We compare the fitness of the alignments given by the proposed techniques with the fitness of CLUSTAL W alignments given in the COG database
Keywords
DNA; biology computing; computational complexity; evolutionary computation; COG database; DNA sequences; NP-hard; biologically related protein sequences; clusters of orthologous groups of proteins; evolutionary computation; multiple sequence alignment; polynomial time; simulation; Clustering algorithms; Computational modeling; DNA; Databases; Evolutionary computation; Genetic mutations; History; Polynomials; Proteins; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Evolutionary Computation, 2000. Proceedings of the 2000 Congress on
Conference_Location
La Jolla, CA
Print_ISBN
0-7803-6375-2
Type
conf
DOI
10.1109/CEC.2000.870716
Filename
870716
Link To Document