• DocumentCode
    3107498
  • Title

    Prediction of O-Glycosylation Sites in Protein Sequence by Kernel Principal Component Analysis

  • Author

    Yang, Xue-Mei ; Cui, Xue-Wei ; Yang, Xue-Zhu

  • Author_Institution
    Coll. of Math. & Inf. Sci., Xianyang Normal Univ., Xianyang, China
  • fYear
    2010
  • fDate
    26-28 Sept. 2010
  • Firstpage
    267
  • Lastpage
    270
  • Abstract
    O-glycosylation is one of the main types of the mammalian protein glycosylation, it occurs on the particular site of serine and threonine. It´s important to predict the O-glycosylation site. In this paper, we propose a new method of kernel principal component analysis (KPCA) to predict the O-glycosylation site with window size w=9. The samples for experiment are encoded by the sparse coding and projected into kernel space first, then the features are extracted by PCA, at last the classification is done by Mahanalobis distance. The result of experiments shows that the proposed method of KPCA is more effective and accurate than PCA. The prediction accuracy is about 84.5%.
  • Keywords
    bioinformatics; feature extraction; pattern classification; principal component analysis; O-glycosylation site prediction; features extraction; kernel principal component analysis; mammalian protein glycosylation; protein sequence; sparse coding; Accuracy; Encoding; Kernel; Principal component analysis; Protein sequence; Training; KPCA; classification; glycosylation; prediction; protein; sparse coding;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Aspects of Social Networks (CASoN), 2010 International Conference on
  • Conference_Location
    Taiyuan
  • Print_ISBN
    978-1-4244-8785-1
  • Type

    conf

  • DOI
    10.1109/CASoN.2010.68
  • Filename
    5636903