DocumentCode
3410908
Title
MUSCLE: multiple sequence alignment with improved accuracy and speed
Author
Edgar, Robert C.
Author_Institution
Dept. of Plant & Microbial Biol., California Univ., Berkeley, CA, USA
fYear
2004
fDate
16-19 Aug. 2004
Firstpage
728
Lastpage
729
Abstract
We present MUSCLE, a new program for creating multiple alignments of protein sequences. MUSCLE achieves the highest scores so far reported on four alignment benchmarks: Balibase, PREFAB, SABmark and SMART, achieving accuracy from 1% to 2.5% higher than T-Coffee and execution times that are generally lower than CLUSTALW for typical input data. With options designed for high-throughput applications, MUSCLE gives average accuracy statistically indistinguishable from T-Coffee and is the fastest published method for large numbers of sequences, able to align 5,000 sequences of length 300 in 7 minutes on a desktop computer. MUSCLE is freely available at http://www.drive5.com/muscle.
Keywords
biology computing; molecular biophysics; proteins; Balibase; CLUSTALW; MUSCLE program; PREFAB; SABmark; SMART; T-Coffee; multiple sequence alignment; protein sequences; Application software; Binary trees; Clustering algorithms; Computational complexity; Frequency; Muscles; Phylogeny; Plants (biology); Proteins; Sequences;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN
0-7695-2194-0
Type
conf
DOI
10.1109/CSB.2004.1332560
Filename
1332560
Link To Document