• DocumentCode
    20304
  • Title

    Identification of DNA-Binding and Protein-Binding Proteins Using Enhanced Graph Wavelet Features

  • Author

    Yuan Zhu ; Weiqiang Zhou ; Dao-Qing Dai ; Hong Yan

  • Author_Institution
    Dept. of Math., Guangdong Univ. of Finance & Econ., Guangzhou, China
  • Volume
    10
  • Issue
    4
  • fYear
    2013
  • fDate
    July-Aug. 2013
  • Firstpage
    1017
  • Lastpage
    1031
  • Abstract
    Interactions between biomolecules play an essential role in various biological processes. For predicting DNA-binding or protein-binding proteins, many machine-learning-based techniques have used various types of features to represent the interface of the complexes, but they only deal with the properties of a single atom in the interface and do not take into account the information of neighborhood atoms directly. This paper proposes a new feature representation method for biomolecular interfaces based on the theory of graph wavelet. The enhanced graph wavelet features (EGWF) provides an effective way to characterize interface feature through adding physicochemical features and exploiting a graph wavelet formulation. Particularly, graph wavelet condenses the information around the center atom, and thus enhances the discrimination of features of biomolecule binding proteins in the feature space. Experiment results show that EGWF performs effectively for predicting DNA-binding and protein-binding proteins in terms of Matthew´s correlation coefficient (MCC) score and the area value under the receiver operating characteristic curve (AUC).
  • Keywords
    DNA; biological techniques; biology computing; graph theory; molecular biophysics; proteins; DNA-binding proteins; EGWF; area under ROC curve; biomolecular interfaces; biomolecule binding proteins; biomolecule interactions; complex interface; enhanced graph wavelet features; feature representation method; graph wavelet formulation; graph wavelet theory; interface feature; machine learning based techniques; physicochemical features; protein feature discrimination; protein-binding proteins; receiver operating characteristic; Atomic measurements; Bioinformatics; Computational biology; Correlation; Educational institutions; Feature extraction; Proteins; Protein-protein interaction; alpha shape model; graph wavelet; protein-DNA interaction;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2013.117
  • Filename
    6606795