• DocumentCode
    3269040
  • Title

    Prediction of protein secondary structure using large margin nearest neighbor classification

  • Author

    Yang, Wei ; Wang, Kuanquan ; Zuo, Wangmeng

  • Author_Institution
    Biocomput. Res. Centre, Harbin Inst. of Technol., Harbin, China
  • fYear
    2011
  • fDate
    18-20 Jan. 2011
  • Firstpage
    202
  • Lastpage
    205
  • Abstract
    Prediction of protein secondary structure from a primary sequence plays a critical role in structural biology. In this paper, we introduce a novel method for protein secondary structure prediction by using PSSM profiles and large margin nearest neighbor classification. Although the PSSM profiles and traditional nearest neighbor (NN) method can be directly used to predict secondary structure, since the PSSM profiles are not specifically designed for protein secondary structure prediction, the NN method could not achieve satisfactory prediction accuracy. To addressing this problem, we use a large margin nearest neighbor model to learn a Mahalanobis distance metric via convex semidefinite programming for nearest neighbor classification. Then, an energy-based rule is invoked to assign secondary structure. Tests show that, compared with other NN methods, significant performance improvement has been achieved with respect to prediction accuracy by the proposed method.
  • Keywords
    bioinformatics; convex programming; pattern classification; proteins; Mahalanobis distance metric; PSSM profiles; convex semidefinite programming; large margin nearest neighbor classification; protein secondary structure; structural biology; Fasteners; Periodic structures; Predictive models; Nearest neighbor; distance metric; large margin; protein secondary structure prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computer Control (ICACC), 2011 3rd International Conference on
  • Conference_Location
    Harbin
  • Print_ISBN
    978-1-4244-8809-4
  • Electronic_ISBN
    978-1-4244-8810-0
  • Type

    conf

  • DOI
    10.1109/ICACC.2011.6016397
  • Filename
    6016397