Title :
Sampling Based Meta-algorithms for Accurate Multiple Sequence Alignment
Author :
Thapar, Vishal ; Rajasekaran, Sanguthevar
Author_Institution :
Dept. of Comput. Sci., Univ. of Connecticut, Storrs, CT
Abstract :
The problem of multiple sequence alignment (MSA) in biology has been studied extensively. In this paper we offer sampling based algorithms for MSA that are more accurate than existing algorithms. The performance of our algorithms has been evaluated using the standard BaliBase dataset [10]. We are able to improve the average alignment SP score for ClustalW [9] and MAFFT [6] by 16.39% and 12.2 %, respectively and the TC score by 55.4% and 46.9%, respectively using sampling. Given that ClustalW and MAFFT are some of the most accurate algorithms for MSA currently and are widely used, our algorithms would be of interest to biologists.
Keywords :
biological techniques; biology computing; sampling methods; sequences; BaliBase dataset; ClustalW algorithm; MAFFT algorithm; Multiple Sequence Alignment; metaalgorithms; sampling; Algorithm design and analysis; Benchmark testing; Bioinformatics; Biology computing; Computer science; Measurement standards; Performance analysis; Runtime; Sampling methods; Sequences;
Conference_Titel :
Bioinformatics and Biomedicine, 2008. BIBM '08. IEEE International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
978-0-7695-3452-7
DOI :
10.1109/BIBM.2008.51