DocumentCode :
3180385
Title :
Classification of proteins in intracellular and secretory pathway using global descriptors of amino acid sequence
Author :
Govindan, Geetha ; Nair, Achuthsankar S.
Author_Institution :
Centre for Excellence in Bioinf., Univ. of Kerala, Thiruvananthapuram, India
fYear :
2011
fDate :
11-14 Dec. 2011
Firstpage :
160
Lastpage :
164
Abstract :
It is widely recognized that the information from the amino acid sequence can serve as crucial pointers in predicting subcellular location of proteins. We introduce a new feature vector for predicting proteins targeted to various compartments in the intracellular and secretory pathway from protein sequence. Features are based on the global Composition, Transition and Distribution (CTD) of amino acid attributes such as hydrophobicity, normalized van der Waals volume, polarity, polarizability, charge, secondary structure and solvent accessibility. Sequences are considered in three equal parts and the features are extracted separately for all the three parts. Based on the feature vectors, we have trained a Support Vector Machine to classify intracellular and secretory proteins. Our method gives an accuracy of 92% in human, 88% in plant and 95% in fungi with independent dataset at root level of the protein sorting pathway.
Keywords :
bioinformatics; cellular transport; feature extraction; hydrophobicity; molecular biophysics; pattern classification; polarisability; proteins; support vector machines; van der Waals forces; amino acid attribute; amino acid sequence; charge; feature extraction; feature vector; global descriptors; hydrophobicity; intracellular pathway; normalized van der Waals volume; polarity; polarizability; protein classification; protein prediction; protein sorting pathway; secondary structure; secretory pathway; solvent accessibility; subcellular protein location; support vector machine; Amino acids; Bioinformatics; Feature extraction; Proteins; Sorting; Support vector machine classification; Support Vector Machine classification; cellular sorting; composition; distribution; intracellular pathway; protein sorting pathway; protein subcellular localization; secretory pathway; sequence features; transition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technologies (WICT), 2011 World Congress on
Conference_Location :
Mumbai
Print_ISBN :
978-1-4673-0127-5
Type :
conf
DOI :
10.1109/WICT.2011.6141236
Filename :
6141236
Link To Document :
بازگشت