DocumentCode
3107498
Title
Prediction of O-Glycosylation Sites in Protein Sequence by Kernel Principal Component Analysis
Author
Yang, Xue-Mei ; Cui, Xue-Wei ; Yang, Xue-Zhu
Author_Institution
Coll. of Math. & Inf. Sci., Xianyang Normal Univ., Xianyang, China
fYear
2010
fDate
26-28 Sept. 2010
Firstpage
267
Lastpage
270
Abstract
O-glycosylation is one of the main types of the mammalian protein glycosylation, it occurs on the particular site of serine and threonine. It´s important to predict the O-glycosylation site. In this paper, we propose a new method of kernel principal component analysis (KPCA) to predict the O-glycosylation site with window size w=9. The samples for experiment are encoded by the sparse coding and projected into kernel space first, then the features are extracted by PCA, at last the classification is done by Mahanalobis distance. The result of experiments shows that the proposed method of KPCA is more effective and accurate than PCA. The prediction accuracy is about 84.5%.
Keywords
bioinformatics; feature extraction; pattern classification; principal component analysis; O-glycosylation site prediction; features extraction; kernel principal component analysis; mammalian protein glycosylation; protein sequence; sparse coding; Accuracy; Encoding; Kernel; Principal component analysis; Protein sequence; Training; KPCA; classification; glycosylation; prediction; protein; sparse coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Aspects of Social Networks (CASoN), 2010 International Conference on
Conference_Location
Taiyuan
Print_ISBN
978-1-4244-8785-1
Type
conf
DOI
10.1109/CASoN.2010.68
Filename
5636903
Link To Document