Applications of Bioinformatics Databases to Predict the Secondary Structure

Author

Mugilan, S. Arul

Author_Institution

Dept. of Biotechnol., Kalasalingam Univ., Krishnankoil, India

fYear

2008

fDate

20-22 Dec. 2008

Firstpage

847

Lastpage

851

Abstract

A computing engine, innovative structure prediction parameters (ISPP) has been developed using Python programming. The proposed computing engine has several utilities to enable structural biologists to predict the secondary structural elements from amino acid sequence using Bioinformatics Databases (25% threshold of 3693 non-homologous proteins). Information about the secondary structure of a protein sequence can greatly assist biologists in the generation and testing of hypotheses, as well as design of experiments. The sequential information of proteins has been increasing many folds than their three-dimensional counterpart. In this paper, structure prediction parameters using chi square value with respect to a-helix, B-strand and random structures were generated for the 20 amino acid singlets, 400 amino acid doublets and 8000 amino acid triplets. An innovative method, ISPBD (Innovative Structure Prediction using Bioinformatics Databases), was developed to predict the secondary structure of the proteins from amino acid sequences using the generated structure prediction parameters. The result clearly indicates that the average value of the percentage of prediction accuracy for a-helix by ISPBD, SSPDP, NNPREDICT, DSC, NNSSP and PHD methods was found to be 61%, 57%, 44%, 55%, 59% and 67%. The average percentage value of prediction for B-strand is 70%, 69%, 21% 34%, 56% and 53% respectively by ISPBD, SSPDP, NNPREDICT, DSC, NNSSP and PHD methods. This clearly indicates that for helical prediction my method (ISPBD) has comparable prediction accuracy as that of SSPDP,PHD method but much better prediction than DSC, NNSSP and NNPREDICT. The method (ISPBD) can be used as a candidature for secondary structure prediction from amino acid sequence.

Keywords

bioinformatics; deductive databases; proteins; Python programming; amino acid sequences; bioinformatics databases; computing engine; helical prediction; innovative structure prediction; protein secondary structure prediction; protein sequence; Accuracy; Amino acids; Application software; Bioinformatics; Biology computing; Biotechnology; Data engineering; Databases; Engines; Protein engineering; Bioinformatics database analysis; Chi square value; Secondary Structure Prediction;

fLanguage

English

Publisher

ieee

Conference_Titel

Advanced Computer Theory and Engineering, 2008. ICACTE '08. International Conference on

Conference_Location

Phuket

Print_ISBN

978-0-7695-3489-3

Type

conf

DOI

10.1109/ICACTE.2008.63

Filename

4737077