Title of article :
Automatic Discovery of Sub-molecular Sequence Domains in Multi-aligned Sequences: A Dynamic Programming Algorithm for Multiple Alignment Segmentation
Author/Authors :
XING، نويسنده , , ERIC POE and WOLF، نويسنده , , DENISE M. and DUBCHAK، نويسنده , , INNA and SPENGLER، نويسنده , , SYLVIA and ZORN، نويسنده , , MANFRED and MUCHNIK، نويسنده , , ILYA and KULIKOWSKI، نويسنده , , CASIMIR، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2001
Pages :
11
From page :
129
To page :
139
Abstract :
Automatic identification of sub-structures in multi-aligned sequences is of great importance for effective and objective structural/functional domain annotation, phylogenetic treeing and other molecular analyses. We present a segmentation algorithm that optimally partitions a given multi-alignment into a set of potentially biologically significant blocks, or segments. This algorithm applies dynamic programming and progressive optimization to the statistical profile of a multi-alignment in order to optimally demarcate relatively homogenous sub-regions. Using this algorithm, a large multi-alignment of eukaryotic 16S rRNA was analyzed. Three types of sequence patterns were identified automatically and efficiently: shared conserved domain; shared variable motif; and rare signature sequence. Results were consistent with the patterns identified through independent phylogenetic and structural approaches. This algorithm facilitates the automation of sequence-based molecular structural and evolutionary analyses through statistical modeling and high performance computation.
Journal title :
Journal of Theoretical Biology
Serial Year :
2001
Journal title :
Journal of Theoretical Biology
Record number :
1534916
Link To Document :
بازگشت