Title :
Pairwise DNA Alignment with Sequence Specific Transition-Transversion Ratio Using Multiple Parameter Sets
Author :
Agrawal, Ankit ; Huang, Xiaoqiu
Author_Institution :
Dept. of Comput. Sci., Iowa State Univ., Ames, IA, USA
Abstract :
Pairwise DNA and protein sequence alignment is an underlying task in bioinformatics which forms the basis of many other bioinformatics applications. Protein sequence alignment is in general given more importance than DNA sequence alignment, and protein sequence alignment methods can usually be used with little modification for DNA sequences as well. However, alignment methods specific to DNA sequence alignment using sequence specific information are highly desirable. Most existing DNA alignment programs routinely use the common match/mismatch scoring scheme. Recently, an iterative alignment scheme using sequence-specific transition-transversion ratio was shown to be better than using a simple match/mismatch scoring scheme. In this paper, we present a modification to the iterative approach by incorporating in it the use of multiple parameter sets. Preliminary experiments indicate that using multiple parameter sets gives significantly better performance than using a single parameter set, and than using a simple match/mismatch scoring scheme. Sequence specific scoring matrices have been shown to be highly successful for protein alignment over the last decade, and the current work should be a significant step in the direction of using sequence specific substitution matrices for DNA sequences.
Keywords :
DNA; bioinformatics; iterative methods; matrix algebra; proteins; sequences; DNA alignment programs; DNA sequence alignment; bioinformatics; iterative alignment scheme; multiple parameter sets; pairwise DNA alignment; protein sequence alignment; sequence specific information; sequence specific scoring matrices; sequence specific substitution matrices; sequence specific transition-transversion ratio; Application software; Bioinformatics; Computer science; DNA; Databases; Dynamic programming; Information technology; Iterative methods; Matrices; Protein sequence; Pairwise statistical significance; Sequence alignment; Substitution Matrix; Transition-Transversion ratio;
Conference_Titel :
Information Technology, 2008. ICIT '08. International Conference on
Conference_Location :
Bhubaneswar
Print_ISBN :
978-1-4244-3745-0
DOI :
10.1109/ICIT.2008.62