DocumentCode
1988479
Title
Fold recognition using sequence fingerprints of protein local substructures
Author
Kryshtafovych, Andriy ; Hvidsten, Torgeir R. ; Komorowski, Jan ; Fidelis, Krzysztof
Author_Institution
Lawrence Livermore Nat. Lab., Berkeley, CA, USA
fYear
2003
fDate
11-14 Aug. 2003
Firstpage
517
Lastpage
518
Abstract
A protein local substructure (descriptor) is a set of several short nonoverlapping fragments of the polypeptide chain. Each substructure describes local environment of a particular residue and includes only those segments of the main chain that are located in the proximity of that residue. Similar descriptors from the representative set of proteins were analyzed to reveal links between the substructures and the sequences of their segments. Using the detected sequence-based fingerprints, specific geometrical conformations are assigned to new sequences. The ability of the approach to recognize correct SCOP folds was tested on 273 sequences from the 49 most popular folds. Good predictions were obtained in 85% of cases. No performance drop was observed with decreasing sequence similarity between target sequences and sequences from the training set of proteins.
Keywords
biology computing; pattern recognition; proteins; fold recognition; polypeptide chain; protein local substructure; sequence-based fingerprints; Amino acids; Assembly; Bioinformatics; Fingerprint recognition; Laboratories; Libraries; Prediction methods; Protein engineering; Shape; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics Conference, 2003. CSB 2003. Proceedings of the 2003 IEEE
Print_ISBN
0-7695-2000-6
Type
conf
DOI
10.1109/CSB.2003.1227393
Filename
1227393
Link To Document