• DocumentCode
    2541790
  • Title

    Prediction of Protein B-Factor Profile Based on Feature Selection and Kernel Learning

  • Author

    Pan, Xiao-Yong ; Shen, Hong-Bin

  • Author_Institution
    Inst. of Image Process. & Pattern Recognition, Shanghai Jiaotong Univ., Shanghai, China
  • fYear
    2009
  • fDate
    4-6 Nov. 2009
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    B-factor reflects the atom´s uncertainty about its average position within a crystal structure and is highly correlated with protein functions. In this article, we propose a novel approach to predict the real value of B-factor. We firstly extract features from the protein sequences and their evolution information, then apply random forest tree to select the important features, which are further inputted to a two-stage support vector regression (SVR) for prediction. Our results have revealed that a systematic analysis of the importance of different features makes us have deep insights into the different contributions of features and is very necessary for developing effective B-factor prediction tools. We thus develop an online Web server, which is freely available at http://www.csbio.situ.edu.cn/bioinf/PredBF for academic use.
  • Keywords
    bioinformatics; feature extraction; learning (artificial intelligence); molecular biophysics; proteins; regression analysis; support vector machines; trees (mathematics); SVR; atom uncertainty; crystal structure; feature extraction; feature selection; kernel learning; online Web server; protein B-factor profile function prediction tool; protein sequence evolution information; random forest tree; systematic analysis; two-stage support vector regression; Amino acids; Data mining; Feature extraction; Image processing; Kernel; Pattern recognition; Proteins; Regression tree analysis; Uncertainty; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2009. CCPR 2009. Chinese Conference on
  • Conference_Location
    Nanjing
  • Print_ISBN
    978-1-4244-4199-0
  • Type

    conf

  • DOI
    10.1109/CCPR.2009.5344037
  • Filename
    5344037