Title :
Exploiting Intrastructure Information for Secondary Structure Prediction with Multifaceted Pipelines
Author :
Armano, Giuliano ; Ledda, Filippo
Author_Institution :
Dept. of Electr. & Electron. Eng., Univ. of Cagliari, Cagliari, Italy
Abstract :
Predicting the secondary structure of proteins is still a typical step in several bioinformatic tasks, in particular, for tertiary structure prediction. Notwithstanding the impressive results obtained so far, mostly due to the advent of sequence encoding schemes based on multiple alignment, in our view the problem should be studied from a novel perspective, in which understanding how available information sources are dealt with plays a central role. After revisiting a well-known secondary structure predictor viewed from this perspective (with the goal of identifying which sources of information have been considered and which have not), we propose a generic software architecture designed to account for all relevant information sources. To demonstrate the validity of the approach, a predictor compliant with the proposed generic architecture has been implemented and compared with several state-of-the-art secondary structure predictors. Experiments have been carried out on standard data sets, and the corresponding results confirm the validity of the approach. The predictor is available at http://iasc.diee.unica.it/ssp2/ through the corresponding web application or as downloadable stand-alone portable unpack-and-run bundle.
Keywords :
Internet; bioinformatics; information systems; molecular biophysics; proteins; bioinformatic tasks; downloadable stand-alone portable unpack-run bundle; exploiting intrastructure information; generic architecture; generic software architecture; multifaceted pipelines; proteins; secondary structure prediction; sequence encoding schemes; tertiary structure prediction; web application; Amino acids; Computer architecture; Correlation; Encoding; Pipelines; Prediction algorithms; Proteins; Secondary structure prediction; artificial neural networks.; ensemble architectures; protein encoding schemes; Algorithms; Databases, Protein; Protein Structure, Secondary; Proteins; Sequence Analysis, Protein;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2011.159