• DocumentCode
    893440
  • Title

    A Grid-Enabled Protein Secondary Structure Predictor

  • Author

    Mirto, Maria ; Cafaro, Massimo ; Fiore, Sandro Luigi ; Tartarini, Daniele ; Aloisio, Giovanni

  • Author_Institution
    Center for Adv. Computational Technol., Nat. Nanotechnology Lab.
  • Volume
    6
  • Issue
    2
  • fYear
    2007
  • fDate
    6/1/2007 12:00:00 AM
  • Firstpage
    124
  • Lastpage
    130
  • Abstract
    We present an integrated Grid system for the prediction of protein secondary structures, based on the frequent automatic update of proteins in the training set. The predictor model is based on a feed-forward multilayer perceptron (MLP) neural network which is trained with the back-propagation algorithm; the design reuses existing legacy software and exploits novel grid components. The predictor takes into account the evolutionary information found in multiple sequence alignment (MSA); the information is obtained running an optimized parallel version of the PSI-BLAST tool, based on the MPI Master-Worker paradigm. The training set contains proteins of known structure. Using Grid technologies and efficient mechanisms for running the tools and extracting the data, the time needed to train the neural network is dramatically reduced, whereas the results are comparable to a set of well-known predictor tools.
  • Keywords
    backpropagation; biology computing; grid computing; molecular biophysics; molecular configurations; multilayer perceptrons; proteins; MPI Master-Worker paradigm; PSI-BLAST tool; back-propagation algorithm; feedforward multilayer perceptron; grid; multiple sequence alignment; neural network; protein secondary structure predictor; Algorithm design and analysis; Data mining; Feedforward neural networks; Feedforward systems; Multi-layer neural network; Multilayer perceptrons; Neural networks; Predictive models; Proteins; Software algorithms; Grid computing; Web services; neural networks; protein structure prediction; Algorithms; Artificial Intelligence; Computer Simulation; Internet; Models, Chemical; Models, Molecular; Protein Structure, Secondary; Proteins; Sequence Analysis, Protein; Software; User-Computer Interface;
  • fLanguage
    English
  • Journal_Title
    NanoBioscience, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1536-1241
  • Type

    jour

  • DOI
    10.1109/TNB.2007.897475
  • Filename
    4220634