Title :
Membrane protein prediction using wavelet decomposition and pseudo amino acid based feature extraction
Author :
Hayat, Maqsood ; Khan, Asifullah
Author_Institution :
Dept. of Comput. & Inf. Sci., Pakistan Inst. of Eng. & Appl. Sci., Islamabad, Pakistan
Abstract :
Membrane proteins play an important role in many biological processes and are attractive drug targets. In this study, membrane proteins are classified using two feature extraction and several classification strategies. The first feature extraction strategy is pseudo amino acid (PseAA) composition; utilizing hydrophobicity and hydrophilicity for reflecting the sequence order effects, while the second method is discrete wavelet analysis (DWT); analyzing the different components of a signal localized both in space and scale domains. The nearest neighbor, probabilistic neural network, support vector machine, random forest, and Adaboost are used as basic learning mechanisms. The predicted results of the base learners are combined using majority voting to form an ensemble classifier. The best accuracy obtained for the Jackknife and independent dataset test is 85.4% and 95.3%, respectively. Using performance measures such as MCC, Sensitivity, Specificity, and F-measure, it has been observed that PseAA based prediction is significantly higher than that of the DWT, and is also the best reported, so far.
Keywords :
biology computing; feature extraction; hydrophilicity; hydrophobicity; learning (artificial intelligence); neural nets; probability; proteins; support vector machines; Adaboost; Jackknife; feature extraction; hydrophilicity; hydrophobicity; learning mechanisms; membrane protein prediction; probabilistic neural network; pseudo amino acid composition; random forest; support vector machine; wavelet decomposition; Accuracy; Amino acids; Biomembranes; Discrete wavelet transforms; Feature extraction; Proteins; Support vector machines; Discrete Wavelet Transform (DWT); Ensemble Classifier; Neural Networks; Pseudo Amino Acid (PseAA) Composition;
Conference_Titel :
Emerging Technologies (ICET), 2010 6th International Conference on
Conference_Location :
Islamabad
Print_ISBN :
978-1-4244-8057-9
DOI :
10.1109/ICET.2010.5638392