Title :
Prediction of the Number of Helices for the Twilight Zone Proteins
Author :
Kedarisetti, KanakaDurga ; Chen, Ke ; Kapoor, Aashima ; Kurgan, Lukasz
Author_Institution :
Dept. of Electr. & Comput. Eng., Alberta Univ., Edmonton, Alta.
Abstract :
Protein structure prediction is one of the core research areas in bioinformatics. This paper addresses the protein secondary structure prediction problem for the twilight zone proteins, which are characterized by low, about 25% homology to the sets of known sequences. The commonly used sequence alignment based algorithms fail to provide accurate prediction for sequences of such low homology, and thus alternative solutions should be sought. We propose a novel method that aims at the prediction of the number of helical structures based on the twilight zone protein sequences. The method is based on a custom designed and compact feature based sequences representation and applies a decision tree prediction algorithm. The performed experimental study shows superiority of the proposed method over three other prediction algorithms and the results provided by YASPIN algorithm, which is a state-of-the-art alignment based secondary structure prediction method designed using low homology sequences
Keywords :
biology computing; decision trees; proteins; sequences; YASPIN algorithm; bioinformatics; decision tree prediction algorithm; helices; homology sequences; protein secondary structure prediction; protein sequence; sequence alignment based algorithm; twilight zone proteins; Algorithm design and analysis; Assembly; Bioinformatics; Decision trees; Design methodology; Libraries; Prediction algorithms; Prediction methods; Protein engineering; Protein sequence; Helix Prediction; Protein Secondary Structure; Protein Sequence; Twilight Zone;
Conference_Titel :
Computational Intelligence and Bioinformatics and Computational Biology, 2006. CIBCB '06. 2006 IEEE Symposium on
Conference_Location :
Toronto, Ont.
Print_ISBN :
1-4244-0623-4
Electronic_ISBN :
1-4244-0624-2
DOI :
10.1109/CIBCB.2006.330972