DocumentCode
3159894
Title
A relative position based algorithm to find out the longest common subsequence from multiple biological sequences
Author
Shukla, Amit ; Agarwal, Suneeta
Author_Institution
Comput. Sci. & Eng. Dept., MNNIT Allahabad, Allahabad, India
fYear
2010
fDate
17-19 Sept. 2010
Firstpage
496
Lastpage
502
Abstract
The LCS problem is to find a subsequence that is common to two or more given sequences and is the longest one of such subsequences. This problem of searching the longest common subsequence (LCS) from multiple biological sequences is one of the most fundamental tasks in bioinformatics. In this paper, we present an easy and time efficient parallel algorithm based on calculating the relative positions of characters for finding out the LCS from any number of given DNA, RNA, Protein or general sequences. Here we give the example of DNA sequences, although this can be used for RNA, Protein or general sequences (provided we know the number of characters being used). The speed up in our LCS Algorithm is achieved through the pruning operations, in which we recognize and reject all those nucleotides which cannot generate the next character of the LCS thus reducing the search space and accelerating the search speed.
Keywords
bioinformatics; parallel algorithms; DNA; RNA; longest common subsequence; multiple biological sequences; parallel algorithm; protein; pruning operations; relative position based algorithm; Algorithm design and analysis; Communications technology; Complexity theory; Computers; DNA; Heuristic algorithms; Indexes; bioinformatics; longest common subsequence; parallel algorithm; pruning;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Communication Technology (ICCCT), 2010 International Conference on
Conference_Location
Allahabad, Uttar Pradesh
Print_ISBN
978-1-4244-9033-2
Type
conf
DOI
10.1109/ICCCT.2010.5640478
Filename
5640478
Link To Document