Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2021 Jan 6:2021:6690299.
doi: 10.1155/2021/6690299. eCollection 2021.

iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection

Affiliations
Comparative Study

iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection

Chenchen Ding et al. Comput Math Methods Med. .

Abstract

Identification of bacterial type III secreted effectors (T3SEs) has become a popular research topic in the field of bioinformatics due to its crucial role in understanding host-pathogen interaction and developing better therapeutic targets against the pathogens. However, the recognition of all effector proteins by using traditional experimental approaches is often time-consuming and laborious. Therefore, development of computational methods to accurately predict putative novel effectors is important in reducing the number of biological experiments for validation. In this study, we proposed a method, called iT3SE-PX, to identify T3SEs solely based on protein sequences. First, three kinds of features were extracted from the position-specific scoring matrix (PSSM) profiles to help train a machine learning (ML) model. Then, the extreme gradient boosting (XGBoost) algorithm was performed to rank these features based on their classification ability. Finally, the optimal features were selected as inputs to a support vector machine (SVM) classifier to predict T3SEs. Based on the two benchmark datasets, we conducted a 100-time randomized 5-fold cross validation (CV) and an independent test, respectively. The experimental results demonstrated that the proposed method achieved superior performance compared to most of the existing methods and could serve as a useful tool for identifying putative T3SEs, given only the sequence information.

PubMed Disclaimer

Conflict of interest statement

The authors declare that there is no conflict of interest regarding the publication of this paper.

Figures

Figure 1
Figure 1
System diagram of the proposed iT3SE-PX model.
Figure 2
Figure 2
This graph shows how different top K features affect the overall accuracies.
Figure 3
Figure 3
ROC curves of SVM, RF, and NB classifiers based on the 5-fold CV tests. The AUC values were calculated and shown in the inset.

Similar articles

Cited by

References

    1. Deng W., Marshall N. C., Rowland J. L., et al. Assembly, structure, function and regulation of type III secretion systems. Nature Reviews Microbiology. 2017;15(6):323–337. doi: 10.1038/nrmicro.2017.20. - DOI - PubMed
    1. Hu Y., Huang H., Cheng X., et al. A global survey of bacterial type III secretion systems and their effectors. Environmental Microbiology. 2017;19(10):3879–3895. doi: 10.1111/1462-2920.13755. - DOI - PubMed
    1. Arnold R., Brandmaier S., Kleine F., et al. Sequence-based prediction of type III secreted proteins. PLoS Pathogens. 2009;5(4, article e1000376) doi: 10.1371/journal.ppat.1000376. - DOI - PMC - PubMed
    1. Yang Y., Zhao J., Morgan R. L., Ma W., Jiang T. Computational prediction of type III secreted proteins from gram-negative bacteria. BMC Bioinformatics. 2010;11(Supplement 1):p. S47. doi: 10.1186/1471-2105-11-s1-s47. - DOI - PMC - PubMed
    1. Dong X., Lu X., Zhang Z. BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors. Database. 2015;2015, article bav064 doi: 10.1093/database/bav064. - DOI - PMC - PubMed

Substances

LinkOut - more resources