DocumentCode
165992
Title
Progressive alignment using Shortest Common Supersequence
Author
Garg, Adesh ; Garg, Deepak
Author_Institution
Dept. of Comput. Sci. & Eng., Thapar Univ., Patiala, India
fYear
2014
fDate
24-27 Sept. 2014
Firstpage
1113
Lastpage
1117
Abstract
Multiple Sequence Alignment is an NP-hard problem. The complexity of finding the optimal alignment is O(LN) where L is the length of the longest sequence and N is the number of sequences. Hence the optimal solution is nearly impossible for most of the datasets. Progressive alignment solves MSA in very economic complexity but does not provide accurate solutions because there is a tradeoff between accuracy and complexity. Guide tree that guides the alignment of sequences is generated by alignment score in progressive alignment. In this paper, Shortest Common Supersequence (SCS) is utilized to generate the guide tree for progressive alignment and the output alignment results are checked by BAliBASE benchmarks for accuracy. According to SP and TC scores, progressive alignment using the guide tree generated by SCS is better than the guide tree generated by alignment score. Original ClustalW2.1 is modified by SCS, and modified ClustalW2.1 gives better results than the original tool.
Keywords
bioinformatics; computational complexity; genetics; optimisation; trees (mathematics); DNA sequences; NP-hard problem; SCS; guide tree; multiple sequence alignment; progressive alignment; shortest common supersequence; Accuracy; Bioinformatics; Complexity theory; DNA; Hidden Markov models; Proteins; Vegetation; BAliBASE; ClustalW2; Multiple sequence alignment; Progressive alignment; SCS;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location
New Delhi
Print_ISBN
978-1-4799-3078-4
Type
conf
DOI
10.1109/ICACCI.2014.6968310
Filename
6968310
Link To Document