DocumentCode
3483566
Title
Optimal approach to sequence-to-sequence prediction: applications in bioinformatics
Author
Nguyen, Minh Ngoc ; Rajapakse, Jagath C.
Author_Institution
Sch. of Comput. Eng., Nat. Technol. Univ., Singapore
Volume
5
fYear
2002
fDate
18-22 Nov. 2002
Firstpage
2254
Abstract
We propose a two-stage approach to sequence-to-sequence prediction problem by using support vector machines (SVMs) to optimize the prediction from single stage techniques. The sequence-to-sequence prediction problem is common to many bioinformatics applications and we demonstrate our approach by using it on the protein secondary structure prediction problem. The new predictor combining different types of GOR (Gamier, Osguthorpe, and Robson) and Bayesian classifiers with SVMs achieves an accuracy of 70.9% when using the sevenfold cross validation on a database of 126 nonhomologous globular proteins. Extending the method to multiple sequence alignments of homologous proteins significantly increases the prediction accuracy to 72.1%. The results show that it is possible to obtain a higher accuracy with combined hierarchical classifiers than single stage classifiers alone, in the sequence prediction.
Keywords
Bayes methods; biology computing; pattern classification; scientific information systems; support vector machines; Bayesian classifiers; GOR classifiers; SVMs; bioinformatics; nonhomologous globular protein database; optimal sequence-to-sequence prediction; protein secondary structure prediction problem; support vector machines; Accuracy; Application software; Bayesian methods; Bioinformatics; Databases; Neural networks; Proteins; Sequences; Statistics; Support vector machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Information Processing, 2002. ICONIP '02. Proceedings of the 9th International Conference on
Print_ISBN
981-04-7524-1
Type
conf
DOI
10.1109/ICONIP.2002.1201894
Filename
1201894
Link To Document