Title :
SCOP family fingerprints: An information theoretic approach to structural classification of protein domains
Author :
Casagrande, Alberto ; Fabris, Francesco
Author_Institution :
Dipt. di Mat. e Inf., Univ. di Trieste, Trieste, Italy
Abstract :
Protein domain classification is a useful instrument to deduce functional properties of proteins. Several databases have been introduced that collect domains having a known structure, and SCOP is probably the most used one. It classifies domains in a four level hierarchy and it groups sequences according to both structural similarity and phylogenetic relation. Many automatic tools to classify domains according to available databases have been proposed so far. In this paper we introduce the notion of “fingerprint” as an easy and readable digest of the similarities between a sequence and an entire set of sequences, and this concept offers us a rationale for building an automatic SCOP classifier which assigns a query sequence to the most likely family. Fingerprint-based analysis has been implemented in a software tool and we report some experimental validations for it.
Keywords :
bioinformatics; genetics; molecular biophysics; molecular configurations; proteins; SCOP family fingerprints; domain classification; four level hierarchy; information theoretic approach; phylogenetic relation; protein domains; software tool; structural classification; structural similarity; Amino acids; Biological system modeling; Convergence; Databases; Protein engineering; Proteins; BLOSUM Spectrum; Domain Family Characterization; Protein Domains; SCOP Classification;
Conference_Titel :
Bioinformatics and Biomedicine Workshops (BIBMW), 2011 IEEE International Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4577-1612-6
DOI :
10.1109/BIBMW.2011.6112408