Title :
A fast and efficient nearest neighbor method for protein secondary structure prediction
Author :
Yang, Wei ; Wang, Kuanquan ; Zuo, Wangmeng
Author_Institution :
Biocomput. Res. Centre, Harbin Inst. of Technol., Harbin, China
Abstract :
Using PSSM profiles, various machine learning methods have been successfully developed for protein secondary structure prediction. With the steady increase of protein structure data, the probability of having available homologous structural information of the protein in real prediction is now fairly high and will continue to increase. Therefore, how to effectively utilize the ever-growing protein structure data has become a huge challenge and opportunity. In this paper, we propose a novel nearest neighbor method, DPred, to use both homologous and non-homologous information for protein secondary structure prediction. On the dataset composed of new solved proteins, the method achieves the overall Q3 and SOV scores of 87.51% and 86.50%, which is comparable with Porter_H and better than PROTEUS and CDM.
Keywords :
bioinformatics; learning (artificial intelligence); pattern clustering; probability; proteins; DPred; PSSM profiles; Porter_H; homologous structural information; machine learning method; nearest neighbor method; position specific scoring matrices; protein secondary structure prediction; Proteins; Nearest neighbor; bioinformatics; data mining; protein secondary structure prediction;
Conference_Titel :
Advanced Computer Control (ICACC), 2011 3rd International Conference on
Conference_Location :
Harbin
Print_ISBN :
978-1-4244-8809-4
Electronic_ISBN :
978-1-4244-8810-0
DOI :
10.1109/ICACC.2011.6016402