Title :
A novel feature fusion method for predicting protein subcellular localization with multiple sites
Author :
Dong Wang;Shiyuan Han;Xumi Qu;Wenzheng Bao;Yuehui Chen;Yuling Fan;Jin Zhou
Author_Institution :
School of Information science and Engineering, University of Jinan, Jinan, P. R. China
Abstract :
This paper proposes a novel feature fusion method for the protein subcellular multiple-site localization prediction. Several types of features are employed in this novel protein coding method. The first one is the composition of amino acids. The second is pseudo amino acid composition, which mainly extract the location information of each amino acid residues in protein sequence. Lastly, the information for local sequence of amino acids is taken into consideration in this research. Generally, k nearest neighbor, supporting vector machine and other methods, has been used in the field of protein subcellular localization prediction. In our research, the multi-label k nearest neighbor algorithm has been employed in the classification model. The overall accuracy rate may reach 66.7304% in Gnos-mploc dataset.
Keywords :
"Amino acids","Feature extraction","Protein sequence","Prediction algorithms","Algorithm design and analysis","Classification algorithms"
Conference_Titel :
Informative and Cybernetics for Computational Social Systems (ICCSS), 2015 International Conference on
DOI :
10.1109/ICCSS.2015.7281141