• DocumentCode
    3410908
  • Title

    MUSCLE: multiple sequence alignment with improved accuracy and speed

  • Author

    Edgar, Robert C.

  • Author_Institution
    Dept. of Plant & Microbial Biol., California Univ., Berkeley, CA, USA
  • fYear
    2004
  • fDate
    16-19 Aug. 2004
  • Firstpage
    728
  • Lastpage
    729
  • Abstract
    We present MUSCLE, a new program for creating multiple alignments of protein sequences. MUSCLE achieves the highest scores so far reported on four alignment benchmarks: Balibase, PREFAB, SABmark and SMART, achieving accuracy from 1% to 2.5% higher than T-Coffee and execution times that are generally lower than CLUSTALW for typical input data. With options designed for high-throughput applications, MUSCLE gives average accuracy statistically indistinguishable from T-Coffee and is the fastest published method for large numbers of sequences, able to align 5,000 sequences of length 300 in 7 minutes on a desktop computer. MUSCLE is freely available at http://www.drive5.com/muscle.
  • Keywords
    biology computing; molecular biophysics; proteins; Balibase; CLUSTALW; MUSCLE program; PREFAB; SABmark; SMART; T-Coffee; multiple sequence alignment; protein sequences; Application software; Binary trees; Clustering algorithms; Computational complexity; Frequency; Muscles; Phylogeny; Plants (biology); Proteins; Sequences;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
  • Print_ISBN
    0-7695-2194-0
  • Type

    conf

  • DOI
    10.1109/CSB.2004.1332560
  • Filename
    1332560