Title :
Principal Component Analysis for Prediction of O-Linked Glycosylation Sites in Protein by Multi-Layered Neural Networks
Author :
Wang, Chu-Zheng ; Han, Xian-Hua ; Ito, Masahiro ; Nishikawa, Ikuko ; Chen, Yen-wei
Author_Institution :
Coll. of Comput. Sci., Central South Univ. of Forestry & Technol., Changsha, China
Abstract :
Glycosylation is one of the common post-translation modification of protein in eukaryotic cells. Conventional neural network methods have been applied to predict glycosylation sites in protein sequence and the prediction accuracy is dependent on the dimension of feature vector (length of protein sequence). Though the prediction accuracy can be improved by increasing the length of protein sequence, it is time-consuming. In this paper, we propose a novel approach which combines PCA with a multilayer neural network for efficient, and accurate prediction of O-glycosylation sites in protein. PCA is first used to extract main basis (subspace) of the protein sequence. The lower-dimensional projections on the subspace are used as features instead of the higher-dimensional protein sequences. Compared with conventional method (without PCA), our proposed method can significantly reduce the larger computation cost, while can keep the higher prediction accuracy.
Keywords :
biology computing; neural nets; principal component analysis; proteins; O-linked glycosylation sites prediction; eukaryotic cells; multi-layered neural networks; prediction accuracy; principal component analysis; protein post-translation modification; protein sequence; Accuracy; Amino acids; Computational efficiency; Computer networks; Educational institutions; Intelligent networks; Multi-layer neural network; Neural networks; Principal component analysis; Protein sequence; O-glycosylation; multi-layered neural networks; pattern analysis; principal component analysis;
Conference_Titel :
Intelligent Information Hiding and Multimedia Signal Processing, 2009. IIH-MSP '09. Fifth International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4244-4717-6
Electronic_ISBN :
978-0-7695-3762-7
DOI :
10.1109/IIH-MSP.2009.182