DocumentCode :
1798314
Title :
DL-PRO: A novel deep learning method for protein model quality assessment
Author :
Nguyen, Son P. ; Yi Shang ; Dong Xu
Author_Institution :
Dept. of Comput. Sci., Univ. of Missouri, Columbia, MO, USA
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
2071
Lastpage :
2078
Abstract :
Computational protein structure prediction is very important for many applications in bioinformatics. In the process of predicting protein structures, it is essential to accurately assess the quality of generated models. Although many single-model quality assessment (QA) methods have been developed, their accuracy is not high enough for most real applications. In this paper, a new approach based on C-a atoms distance matrix and machine learning methods is proposed for single-model QA and the identification of native-like models. Different from existing energy/scoring functions and consensus approaches, this new approach is purely geometry based. Furthermore, a novel algorithm based on deep learning techniques, called DL-Pro, is proposed. For a protein model, DL-Pro uses its distance matrix that contains pairwise distances between two residues´ C-a atoms in the model, which sometimes is also called contact map, as an orientation-independent representation. From training examples of distance matrices corresponding to good and bad models, DL-Pro learns a stacked autoencoder network as a classifier. In experiments on selected targets from the Critical Assessment of Structure Prediction (CASP) competition, DL-Pro obtained promising results, outperforming state-of-the-art energy/scoring functions, including OPUS-CA, DOPE, DFIRE, and RW.
Keywords :
bioinformatics; learning (artificial intelligence); pattern classification; proteins; C-a atoms distance matrix; CASP competition; Critical Assessment of Structure Prediction competition; DFIRE; DL-PRO; DOPE; OPUS-CA; RW; bioinformatics; classifier; computational protein structure prediction; consensus approach; contact map; deep learning method; energy-scoring function; machine learning methods; native-like model identification; orientation-independent representation; pairwise distance; protein model quality assessment; single-model QA method; single-model quality assessment method; stacked autoencoder network; Classification algorithms; Computational modeling; Predictive models; Proteins; Solid modeling; Three-dimensional displays; Training; Critical Assessment of Structure Prediction (CASP); classification; deep learning; energy and scoring function; protein model quality assessment; stacked autoencoder;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889891
Filename :
6889891
Link To Document :
بازگشت