DocumentCode
893440
Title
A Grid-Enabled Protein Secondary Structure Predictor
Author
Mirto, Maria ; Cafaro, Massimo ; Fiore, Sandro Luigi ; Tartarini, Daniele ; Aloisio, Giovanni
Author_Institution
Center for Adv. Computational Technol., Nat. Nanotechnology Lab.
Volume
6
Issue
2
fYear
2007
fDate
6/1/2007 12:00:00 AM
Firstpage
124
Lastpage
130
Abstract
We present an integrated Grid system for the prediction of protein secondary structures, based on the frequent automatic update of proteins in the training set. The predictor model is based on a feed-forward multilayer perceptron (MLP) neural network which is trained with the back-propagation algorithm; the design reuses existing legacy software and exploits novel grid components. The predictor takes into account the evolutionary information found in multiple sequence alignment (MSA); the information is obtained running an optimized parallel version of the PSI-BLAST tool, based on the MPI Master-Worker paradigm. The training set contains proteins of known structure. Using Grid technologies and efficient mechanisms for running the tools and extracting the data, the time needed to train the neural network is dramatically reduced, whereas the results are comparable to a set of well-known predictor tools.
Keywords
backpropagation; biology computing; grid computing; molecular biophysics; molecular configurations; multilayer perceptrons; proteins; MPI Master-Worker paradigm; PSI-BLAST tool; back-propagation algorithm; feedforward multilayer perceptron; grid; multiple sequence alignment; neural network; protein secondary structure predictor; Algorithm design and analysis; Data mining; Feedforward neural networks; Feedforward systems; Multi-layer neural network; Multilayer perceptrons; Neural networks; Predictive models; Proteins; Software algorithms; Grid computing; Web services; neural networks; protein structure prediction; Algorithms; Artificial Intelligence; Computer Simulation; Internet; Models, Chemical; Models, Molecular; Protein Structure, Secondary; Proteins; Sequence Analysis, Protein; Software; User-Computer Interface;
fLanguage
English
Journal_Title
NanoBioscience, IEEE Transactions on
Publisher
ieee
ISSN
1536-1241
Type
jour
DOI
10.1109/TNB.2007.897475
Filename
4220634
Link To Document