Title :
Fusion of cleavage site detection and pairwise alignment for fast subcellular localization
Author :
Mak, Man-Wai ; Kung, Sun-Yuan
Author_Institution :
Dept. of Electron. & Inf. Eng., Hong Kong Polytech. Univ., Hong Kong
fDate :
March 31 2008-April 4 2008
Abstract :
In recent years, homology-based and signal-based methods have been proposed for predicting the subcellular localization of proteins. While it has been known that homology-based methods can detect more subcellular locations than signal-based methods, the former generally requires a lot more computational resources during both training and prediction. The problem will become intractable for annotating large databases. One possible solution is to reduce the sequence length. This paper proposes to use the cleavage sites detected by signal-based methods (e.g., TargetP) to extract the sequence or profile segments that contain the most localization information for alignment. It was found that the method can reduce computation time of full-length alignment by 27-fold at a cost of only 8% reduction in prediction accuracy. Moreover, the method can increase the accuracy by 0.8% and at the same time reduce the computation time by 41%. Results also show that cutting the sequences at the cleavage sites detected by TargetP is better than cutting them at a fixed position.
Keywords :
feature extraction; medical signal detection; proteins; cleavage site detection; fast subcellular localization; full-length alignment; fusion method; homology-based methods; pairwise alignment; protein subcellular localization; signal-based methods; Amino acids; Biomembranes; Cells (biology); Costs; Data mining; Databases; Protein engineering; Sequences; Signal detection; Sorting; Pairwise alignment; TargetP; cleavage sites; profile; protein sequences; subcellular localization;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2008.4517674