• DocumentCode
    952339
  • Title

    Evaluating Protein Similarity from Coarse Structures

  • Author

    Wang, Yong ; Ling-yun Wu ; Zhang, Ji-Hong ; Zhan, Zhong-Wei ; Xiang-Sun Zhang ; Luonan Chen

  • Author_Institution
    Inst. of Appl. Math., Chinese Acad. of Sci., Beijing, China
  • Volume
    6
  • Issue
    4
  • fYear
    2009
  • Firstpage
    583
  • Lastpage
    593
  • Abstract
    To unscramble the relationship between protein function and protein structure, it is essential to assess the protein similarity from different aspects. Although many methods have been proposed for protein structure alignment or comparison, alternative similarity measures are still strongly demanded due to the requirement of fast screening and query in large-scale structure databases. In this paper, we first formulate a novel representation of a protein structure, i.e., feature sequence of surface (FSS). Then, a new score scheme is developed to measure the similarity between two representations. To verify the proposed method, numerical experiments are conducted in four different protein data sets. We also classify SARS coronavirus to verify the effectiveness of the new method. Furthermore, preliminary results of fast classification of the whole CATH v2.5.1 database based on the new macrostructure similarity are given as a pilot study. We demonstrate that the proposed approach to measure the similarities between protein structures is simple to implement, computationally efficient, and surprisingly fast. In addition, the method itself provides a new and quantitative tool to view a protein structure.
  • Keywords
    molecular biophysics; numerical analysis; proteins; CATH v2.5.1 database; SARS coronavirus; feature sequence-of-surface; macrostructure similarity; numerical experiments; protein function; protein similarity; protein structure; Atomic measurements; Biology computing; Data mining; Frequency selective surfaces; Large-scale systems; Mathematics; Organizing; Proteins; Sequences; Spatial databases; Bioinformatics (genome or protein) databases; Machine learning; Optimization; Protein structure; protein surface.; structure comparison; Algorithms; Computational Biology; Computer Simulation; Databases, Protein; Humans; Models, Molecular; Pattern Recognition, Automated; Protein Conformation; Proteins; SARS Virus; Sequence Alignment; Sequence Analysis, Protein; Software;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2007.70250
  • Filename
    4359899