DocumentCode :
419330
Title :
3D structural homology detection via NMR resonance assignment
Author :
Langmead, Christopher James ; Donald, Bruce Randall
Author_Institution :
Carnegie Mellon Dept. of Comput. Sci., Pittsburgh, PA, USA
fYear :
2004
fDate :
16-19 Aug. 2004
Firstpage :
278
Lastpage :
289
Abstract :
One goal of the structural genomics initiative is the identification of new protein folds. Sequence-based structural homology prediction methods are an important means for prioritizing unknown proteins for structure determination. However, an important challenge remains: two highly dissimilar sequences can have similar folds - how can we detect this rapidly, in the context of structural genomics? High-throughput NMR experiments, coupled with novel algorithms for data analysis, can address this challenge. We report an automated procedure, called HD, for detecting 3D structural homologies from sparse, unassigned protein NMR data. Our method identifies 3D models in a protein structural database whose geometries best fit the unassigned experimental NMR data. HD does not use, and is thus not limited by sequence homology. The method can also be used to confirm or refute structural predictions made by other techniques such as protein threading or homology modelling. The algorithm runs in O(pn + pn52/ log (cn) + p log p) time, where p is the number of proteins in the database, n is the number of residues in the target protein and c is the maximum edge weight in an integer-weighted bipartite graph. Our experiments on real NMR data from 3 different proteins against a database of 4,500 representative folds demonstrate that the method identifies closely related protein folds, including sub-domains of larger proteins, with as little as 10-30% sequence homology between the target protein (or sub-domain) and the computed model. In particular, we report no false-negatives or false-positives despite significant percentages of missing experimental data.
Keywords :
biological NMR; biology computing; genetics; molecular biophysics; molecular configurations; physiological models; proteins; 3D structural homology detection; NMR resonance assignment; automated HD procedure; data analysis; homology modelling; integer-weighted bipartite graph; protein folds; protein threading; sequence-based structural homology prediction methods; structural genomics; Bioinformatics; Data analysis; Genomics; Geometry; High definition video; Nuclear magnetic resonance; Prediction methods; Proteins; Solid modeling; Spatial databases;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Systems Bioinformatics Conference, 2004. CSB 2004. Proceedings. 2004 IEEE
Print_ISBN :
0-7695-2194-0
Type :
conf
DOI :
10.1109/CSB.2004.1332441
Filename :
1332441
Link To Document :
بازگشت