DocumentCode :
534908
Title :
Identifying nuclear protein subcellular localization using feature dimension reduction method
Author :
Wang, Tong ; Huang, Qinghua ; Hu, Lihua
Author_Institution :
Inst. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
Volume :
1
fYear :
2010
fDate :
13-14 Sept. 2010
Firstpage :
329
Lastpage :
332
Abstract :
The subcellular location of a protein is closely correlated to its function. Facing the deluge of protein sequences generated in the post-genomic age, it is necessary to develop useful machine learning tools to identify the protein subcellular localization. DR (Dimensional Reduction) method is one of most famous machine learning tools. Some researchers have begun to explore DR method for computer vision problems such as face recognition, few such attempts have been made for classification of high-dimensional protein data sets. In this paper, DR method is employed to reduce the size of the features space. Comparison between linear DR methods (PCA and LDA) and nonlinear DR methods (KPCA and KLDA) is performed to predict subcellular localization of nuclear proteins. Experimental results thus obtained are quite encouraging, which indicate that the DR method is used effectively to deal with this complicated problem of viral proteins subcellular localization prediction. The overall jackknife success rate with KLDA is the highest relative to the other DR methods.
Keywords :
bioinformatics; cellular biophysics; data reduction; learning (artificial intelligence); principal component analysis; proteins; KLDA; KPCA; feature dimension reduction method; feature space size reduction; linear DR method comparison; machine learning tools; nonlinear DR method; nuclear protein subcellular localization; protein function; protein sequences; viral protein subcellular localization prediction; Bioinformatics; Genomics; Proteins; Feature dimension reduction; PSSM(Position-Specific Score Matrix); Subcellular localization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Natural Computing Proceedings (CINC), 2010 Second International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-7705-0
Type :
conf
DOI :
10.1109/CINC.2010.5643828
Filename :
5643828
Link To Document :
بازگشت