Title :
A New Progressive Multiple Sequence Alignment Algorithm
Author :
Hosni, Soumaya ; Mokaddem, Ahmed ; Elloumi, Mourad
Author_Institution :
Lab. of Technol. of Inf. & Commun. & Electr. Eng. (LaTICE), Tunis Univ. of Tunis, Tunis, Tunisia
Abstract :
In this paper, we present a new progressive algorithm for Multiple Sequence Alignment (MSA). Our algorithm adopts a new distance, called anchor distance to compute the distance between two protein sequences, and a variant of the UPGMA method to construct a guide tree. Our algorithm implements also a refinement step. Our algorithm is of complexity O(N4+N*L2) in computing time, where N is the number of the sequences and L is the length of the longest sequence. We asses our algorithm on different protein benchmarks, e.g., BALIBASE, HOMSTRAD, OXBENCH, and we compared the obtained results to those obtained with other alignment algorithms, e.g., CLUSTALW, MUSCLE, MAFFT and PROBCONS, using the Column Score (CS) and the Sum of Pairs Score (SPS). We obtained good results.
Keywords :
biology computing; computational complexity; proteins; BALIBASE; CLUSTALW; CS; HOMSTRAD; MAFFT; MSA; MUSCLE; O(N4+N*L2) complexity; OXBENCH; PROBCONS; SPS; UPGMA method; alignment algorithms; anchor distance; column score; guide tree; progressive multiple sequence alignment algorithm; protein benchmarks; protein sequences; sum of pairs score; Benchmark testing; Bioinformatics; Databases; Muscles; Proteins; Multiple sequence alignment; algorithms; distances; progressive alignment;
Conference_Titel :
Database and Expert Systems Applications (DEXA), 2012 23rd International Workshop on
Conference_Location :
Vienna
Print_ISBN :
978-1-4673-2621-6
DOI :
10.1109/DEXA.2012.8