• DocumentCode
    1874699
  • Title

    Prediction of Protein-Protein Interaction Sites Using Constructive Neural Network Ensemble

  • Author

    Zhang, Yan-ping ; Zhang, Li-Na ; Wang, Yong-Cheng

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Anhui Univ., Hefei, China
  • fYear
    2010
  • fDate
    10-12 Dec. 2010
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Abstract-Prediction of protein-proteininteraction sites is very important to the function of a protein and drug design. In this paper, we adequately utilize the characters of ensemble learning, which can improve the accuracy of individual classifier and generalization ability of the system, and propose a new prediction method of protein-protein interaction sites: ensemble learning method based on the constructive neural network. Protein sequence profile and residue accessible area are used as input feature vectors. We evaluate the ensemble classifiers and compare them with several traditional methods (SVM, ANN, CA and Bayesian) on the dataset of 61 protein chains with 5-fold cross validation. The results clearly show that the proposed ensemble method is quite effective in predicting protein binding sites. Our method achieves good performance (Accuracy of 73.26%, Sensitivity of 58.38%, Specificity of 68.87%, CC of 35.47% and F1-measure of 63.04%), which is significantly better than that of the compared methods. The results obtained show that our proposed method is a promising approach for predicting protein-protein interaction sites.The experiments show the validation and correctness of the ensemble method based on Covering Algorithm (CA).
  • Keywords
    biological techniques; drugs; learning (artificial intelligence); molecular biophysics; neural nets; pattern classification; proteins; proteomics; sequences; 5-fold cross validation; constructive neural network ensemble; covering algorithm; drug design; ensemble classifier; ensemble learning; input feature vector; protein design; protein sequence profile; protein-protein interaction site; residue accessible area; Accuracy; Artificial neural networks; Classification algorithms; Prediction algorithms; Proteins; Sensitivity; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Software Engineering (CiSE), 2010 International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-5391-7
  • Electronic_ISBN
    978-1-4244-5392-4
  • Type

    conf

  • DOI
    10.1109/CISE.2010.5676946
  • Filename
    5676946