Title :
A method for improving the accuracy of predicting protein localization
Author :
Wang, Tong ; Huang, Qinghua ; Yao, Lixiu
Author_Institution :
Inst. of Comput. & Inf., Polytech. Univ., Shanghai, China
Abstract :
In this paper, a system based on the novel Maximum Variance Projection (MVP) is proposed to improve the performance of protein subcellular localization prediction. Firstly, the protein sequences are quantized into a high dimension space using a new representation approach Position-Specific Score Matrix (PSSM). However, the problems caused by such representation are computation complexity and complicated classifier design. To sort out this problem, a new dimension reduction algorithm, MVP, is introduced. It extracts the essential features from the high dimension feature space. Then, K-Nearest Neighbor (K-NN) classifier is employed to recognize the subcellular localization of proteins according to the new features after dimension reduction. A good experimental result is obtained based on the jackknife dataset.
Keywords :
bioinformatics; cellular biophysics; computational complexity; proteins; K-Nearest Neighbor classifier; Maximum Variance Projection; Position-Specific Score Matrix; accuracy; classifier design; computation complexity; jackknife dataset; protein subcellular localization prediction; Accuracy; Amino acids; Classification algorithms; Feature extraction; Prediction algorithms; Principal component analysis; Proteins; Dimensionality reduction; PSSM; Subcellular localization;
Conference_Titel :
Computer Science and Education (ICCSE), 2010 5th International Conference on
Conference_Location :
Hefei
Print_ISBN :
978-1-4244-6002-1
DOI :
10.1109/ICCSE.2010.5593736