DocumentCode :
1435556
Title :
Incremental Mountain Clustering Method to Find Building Blocks for Constructing Structures of Proteins
Author :
Lin, Ken-Li ; Lin, Chin-Teng ; Pal, Nikhil R.
Author_Institution :
Comput. Center, Chung-Hua Univ., Hsinchu, Taiwan
Volume :
9
Issue :
4
fYear :
2010
Firstpage :
278
Lastpage :
288
Abstract :
In this paper we propose an algorithm named Incremental Structural Mountain Clustering Method (ISMCM) with a view to finding a library of building blocks for reconstruction of 3-D structures of proteins/peptides. The building blocks are short structural motifs that are identified based on an estimate of local “density” of 3-D fragments computed using a measure of structural similarity. The structural similarity is computed after the best-molecular-fit alignment of pairs of fragments. The algorithm is tested on two well known benchmark data sets. Following the protocols used by other researchers, for the first data set we reconstruct a set of 71 test peptides (up to first 60 residues) whereas for the second data set we reconstruct all 143 test peptides. The ISMCM algorithm is found to successfully reconstruct the test peptides in terms of both global-fit root-mean-square (RMS) error and local-fit RMS error. The low values of local-fit RMS errors suggest that these building blocks extracted by ISMCM are good quantizers, which can represent nearby fragments quite accurately. To further assess the quality of building blocks we use two alternative graphical ways. We also use Shannon´s entropy to show the structural similarity of the clusters found by our algorithm. This is important as building blocks that represent clusters with structurally similar fragments will be very effective in reconstruction. The entropic analysis reveals a very interesting fact that the secondary structure of the central residue of the fragments in a cluster is most strongly conserved (minimum entropy) over the cluster, which might be an indicator that central residue of the structural motif plays a dominant role in local folding.
Keywords :
bioinformatics; entropy; molecular biophysics; molecular configurations; pattern clustering; proteins; ISMCM; Incremental Structural Mountain Clustering Method; Shannon´s entropy; best-molecular-fit alignment; global-fit root-mean-square error; local folding; local-fit RMS error; protein building blocks; protein structures; structural motif; structural similarity; Algorithm design and analysis; Clustering methods; Peptides; Proteins; Root mean square; Building blocks; incremental structural mountain clustering; protein structure; structural mountain clustering; Algorithms; Cluster Analysis; Entropy; Models, Molecular; Peptides; Protein Interaction Domains and Motifs;
fLanguage :
English
Journal_Title :
NanoBioscience, IEEE Transactions on
Publisher :
ieee
ISSN :
1536-1241
Type :
jour
DOI :
10.1109/TNB.2010.2095467
Filename :
5701731
Link To Document :
بازگشت