مرکز منطقه ای اطلاع رساني علوم و فناوري - Finding protein domain boundaries: an automated, non-homology-based method

DocumentCode :

755877

Title :

Finding protein domain boundaries: an automated, non-homology-based method

Author :

Gurbaxani, Brian M. ; Mallick, Parag

Volume :

Issue :

fYear :

2005

Firstpage :

Lastpage :

Abstract :

A sequence-based methodology identifies the boundaries of structural domains in proteins. The method doesn\´t depend on knowledge of a protein\´s structure or on sequence homologs. We developed a Bayesian approach based on the statistical analysis of word content used in other fields. Our method first catalogs "pattern" frequencies - occurrences of groups of amino acids - in a nonredundant database of known protein domains and then uses the distributions of these patterns to identify regions of protein sequence that appear to signal the beginnings and ends of domains. The domain-delineating signals we\´ve produced using amino acid patterns show great promise in providing further insight into both the biochemistry and structural biology of proteins.

Keywords :

Bayes methods; biology computing; genetics; pattern recognition; proteins; statistical analysis; Bayesian approach; amino acids; automated nonhomology-based method; nonredundant database; pattern frequencies; protein domain boundaries; protein sequence; statistical analysis; Amino acids; Automatic generation control; Bioinformatics; Databases; Dictionaries; Diseases; Genomics; Humans; Protein sequence; Splicing; Bayesian algorithm; amino acid patterns; protein domains;

fLanguage :

English

Journal_Title :

Intelligent Systems, IEEE

Publisher :

ieee

ISSN :

1541-1672

Type :

jour

DOI :

10.1109/MIS.2005.106

Filename :

1556512

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=755877