DocumentCode :
463460
Title :
Protein Fold Recognition using Residue-Based Alignments of Sequence and Secondary Structure
Author :
Aydin, Zafer ; Erdogan, Hakan ; Altunbasak, Yucel
Author_Institution :
Center for Signal & Image Process., Georgia Inst. of Technol., Atlanta, GA
Volume :
1
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Protein structure prediction aims to determine the three-dimensional structure of proteins form their amino acid sequences. When a protein does not have similarity (homology) to any known fold, threading or fold recognition methods are used to predict structure. Fold recognition methods frequently employ secondary structure, solvent accessibility, and evolutionary information to enhance the accuracy and the quality of the predictions. In this paper, we present a residue based alignment method as an alternative to the state-of-the-art SSEA method, originally introduced by Przytycka et al., and further modified by McGuffin et al. We introduce a residue-based score function, which can incorporate amino acid similarity matrices such as BLOSUM into secondary structure similarity scoring and compute joint alignments. We show that the power of the SSEA method comes from the length normalization instead of the element alignment technique and similar performance can be achieved using residue-based alignments of secondary structures by optimizing gap costs. In simulations with the two benchmark datasets, our method performs slightly better than the SSEA in terms of the fold recognition accuracy. When the secondary structure similarity matrix is combined with the amino acid based BLOSUM30 matrix, the accuracy of our method improves further (4% for the McGuffin set and 10% for the Ding and Dubchak set). The availability of aligning the amino acid and secondary structure sequences in a joint manner offers a better starting point for more elaborate techniques that employ profile-profile alignments and machine learning methods.
Keywords :
medical image processing; pattern recognition; proteins; BLOSUM30 matrix; amino acid sequences; element alignment technique; machine learning methods; protein fold recognition; residue-based alignments; secondary structure; three-dimensional structure; Amino acids; Image processing; Image recognition; Learning systems; Libraries; Optimization methods; Protein engineering; Signal processing; Solvents; Target recognition; amino acid alignment; gap cost; protein fold recognition; score normalization; secondary structure alignment;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2007.366688
Filename :
4217088
Link To Document :
بازگشت