• DocumentCode
    2443079
  • Title

    Evolutionary computation techniques for multiple sequence alignment

  • Author

    Cai, Liming ; Juedes, David ; Liakhovitch, Evgueni

  • Author_Institution
    Sch. of Electron. Eng. & Comput. Sci., Ohio Univ., Athens, OH, USA
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    829
  • Abstract
    Given a collection of biologically related protein or DNA sequences, the basic multiple sequence alignment problem is to determine the most biologically plausible alignment of these sequences. Under the assumption that the collection of sequences arose from some common ancestor, an alignment can be used to infer the evolutionary history among the sequences, i.e., the most likely pattern of insertions, deletions and mutations that transformed one sequence into another. The general multiple sequence alignment problem is known to be NP-hard, and hence the problem of finding the best possible multiple sequence alignment is intractable. However, this does not preclude the possibility of developing algorithms that produce near optimal multiple sequence alignments in polynomial time. We examine techniques to combine efficient algorithms for near optimal global and local multiple sequence alignment with evolutionary computation techniques to search for better near optimal sequence alignments. We describe our evolutionary computation approach to multiple sequence alignment and present preliminary simulation results on a set of 17 clusters of orthologous groups of proteins (COGs). We compare the fitness of the alignments given by the proposed techniques with the fitness of CLUSTAL W alignments given in the COG database
  • Keywords
    DNA; biology computing; computational complexity; evolutionary computation; COG database; DNA sequences; NP-hard; biologically related protein sequences; clusters of orthologous groups of proteins; evolutionary computation; multiple sequence alignment; polynomial time; simulation; Clustering algorithms; Computational modeling; DNA; Databases; Evolutionary computation; Genetic mutations; History; Polynomials; Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Evolutionary Computation, 2000. Proceedings of the 2000 Congress on
  • Conference_Location
    La Jolla, CA
  • Print_ISBN
    0-7803-6375-2
  • Type

    conf

  • DOI
    10.1109/CEC.2000.870716
  • Filename
    870716