DocumentCode :
79539
Title :
DALIX: Optimal DALI Protein Structure Alignment
Author :
Wohlers, Inken ; Andonov, Rumen ; Klau, Gunnar W.
Author_Institution :
Genominformatik, Univ. Duisburg-Essen/ Universitatsklinikum, Essen, Germany
Volume :
10
Issue :
1
fYear :
2013
fDate :
Jan.-Feb. 2013
Firstpage :
26
Lastpage :
36
Abstract :
We present a mathematical model and exact algorithm for optimally aligning protein structures using the DALI scoring model. This scoring model is based on comparing the interresidue distance matrices of proteins and is used in the popular DALI software tool, a heuristic method for protein structure alignment. Our model and algorithm extend an integer linear programming approach that has been previously applied for the related, but simpler, contact map overlap problem. To this end, we introduce a novel type of constraint that handles negative score values and relax it in a Lagrangian fashion. The new algorithm, which we call DALIX, is applicable to any distance matrix-based scoring scheme. We also review options that allow to consider fewer pairs of interresidue distances explicitly because their large number hinders the optimization process. Using four known data sets of varying structural similarity, we compute many provably score-optimal DALI alignments. This allowed, for the first time, to evaluate the DALI heuristic in sound mathematical terms. The results indicate that DALI usually computes optimal or close to optimal alignments. However, we detect a subset of small proteins for which DALI fails to generate any significant alignment, although such alignments do exist.
Keywords :
biology computing; molecular configurations; proteins; proteomics; DALI scoring model; DALIX algorithm; contact map overlap problem; exact algorithm; integer linear programming approach; interresidue distance matrices; optimal DALI protein structure alignment; structural similarity; Amino acids; Bioinformatics; Computational biology; Dynamic programming; Linear programming; Mathematical model; Proteins; DALI; Lagrangian relaxation; Structure alignment; exact algorithm; integer linear program; interresidue distance matrix; Algorithms; Computational Biology; Databases, Protein; Models, Chemical; Models, Molecular; Proteins; Sequence Alignment; Sequence Analysis, Protein;
fLanguage :
English
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
1545-5963
Type :
jour
DOI :
10.1109/TCBB.2012.143
Filename :
6365174
Link To Document :
بازگشت