Title of article :
iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection
Author/Authors :
Ding, Chenchen Shanghai Ocean University - Shanghai, China , Han, Haitao Shanghai Ocean University - Shanghai, China , Li, Qianyue Shanghai Ocean University - Shanghai, China , Yang, Xiaoxia Shanghai Ocean University - Shanghai, China , Liu, Taigang Shanghai Ocean University - Shanghai, China
Pages :
8
From page :
1
To page :
8
Abstract :
Identification of bacterial type III secreted effectors (T3SEs) has become a popular research topic in the field of bioinformatics due to its crucial role in understanding host-pathogen interaction and developing better therapeutic targets against the pathogens. However, the recognition of all effector proteins by using traditional experimental approaches is often time-consuming and laborious. Therefore, development of computational methods to accurately predict putative novel effectors is important in reducing the number of biological experiments for validation. In this study, we proposed a method, called iT3SE-PX, to identify T3SEs solely based on protein sequences. First, three kinds of features were extracted from the position-specific scoring matrix (PSSM) profiles to help train a machine learning (ML) model. Then, the extreme gradient boosting (XGBoost) algorithm was performed to rank these features based on their classification ability. Finally, the optimal features were selected as inputs to a support vector machine (SVM) classifier to predict T3SEs. Based on the two benchmark datasets, we conducted a 100-time randomized 5-fold cross validation (CV) and an independent test, respectively. The experimental results demonstrated that the proposed method achieved superior performance compared to most of the existing methods and could serve as a useful tool for identifying putative T3SEs, given only the sequence information.
Keywords :
iT3SE-PX , XGBoost , PSSM , Gram-negative
Journal title :
Computational and Mathematical Methods in Medicine
Serial Year :
2021
Full Text URL :
Record number :
2616242
Link To Document :
بازگشت