DocumentCode
534908
Title
Identifying nuclear protein subcellular localization using feature dimension reduction method
Author
Wang, Tong ; Huang, Qinghua ; Hu, Lihua
Author_Institution
Inst. of Comput. & Inf., Shanghai Second Polytech. Univ., Shanghai, China
Volume
1
fYear
2010
fDate
13-14 Sept. 2010
Firstpage
329
Lastpage
332
Abstract
The subcellular location of a protein is closely correlated to its function. Facing the deluge of protein sequences generated in the post-genomic age, it is necessary to develop useful machine learning tools to identify the protein subcellular localization. DR (Dimensional Reduction) method is one of most famous machine learning tools. Some researchers have begun to explore DR method for computer vision problems such as face recognition, few such attempts have been made for classification of high-dimensional protein data sets. In this paper, DR method is employed to reduce the size of the features space. Comparison between linear DR methods (PCA and LDA) and nonlinear DR methods (KPCA and KLDA) is performed to predict subcellular localization of nuclear proteins. Experimental results thus obtained are quite encouraging, which indicate that the DR method is used effectively to deal with this complicated problem of viral proteins subcellular localization prediction. The overall jackknife success rate with KLDA is the highest relative to the other DR methods.
Keywords
bioinformatics; cellular biophysics; data reduction; learning (artificial intelligence); principal component analysis; proteins; KLDA; KPCA; feature dimension reduction method; feature space size reduction; linear DR method comparison; machine learning tools; nonlinear DR method; nuclear protein subcellular localization; protein function; protein sequences; viral protein subcellular localization prediction; Bioinformatics; Genomics; Proteins; Feature dimension reduction; PSSM(Position-Specific Score Matrix); Subcellular localization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computational Intelligence and Natural Computing Proceedings (CINC), 2010 Second International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-7705-0
Type
conf
DOI
10.1109/CINC.2010.5643828
Filename
5643828
Link To Document