Effective prediction of bacterial type IV secreted effectors by combined features of both C-termini and N-termini
- PMID: 29127583
- DOI: 10.1007/s10822-017-0080-z
Effective prediction of bacterial type IV secreted effectors by combined features of both C-termini and N-termini
Abstract
Various bacterial pathogens can deliver their secreted substrates also called as effectors through type IV secretion systems (T4SSs) into host cells and cause diseases. Since T4SS secreted effectors (T4SEs) play important roles in pathogen-host interactions, identifying them is crucial to our understanding of the pathogenic mechanisms of T4SSs. A few computational methods using machine learning algorithms for T4SEs prediction have been developed by using features of C-terminal residues. However, recent studies have shown that targeting information can also be encoded in the N-terminal region of at least some T4SEs. In this study, we present an effective method for T4SEs prediction by novelly integrating both N-terminal and C-terminal sequence information. First, we collected a comprehensive dataset across multiple bacterial species of known T4SEs and non-T4SEs from literatures. Then, three types of distinctive features, namely amino acid composition, composition, transition and distribution and position-specific scoring matrices were calculated for 50 N-terminal and 100 C-terminal residues. After that, we employed information gain represent to rank the importance score of the 150 different position residues for T4SE secretion signaling. At last, 125 distinctive position residues were singled out for the prediction model to classify T4SEs and non-T4SEs. The support vector machine model yields a high receiver operating curve of 0.916 in the fivefold cross-validation and an accuracy of 85.29% for the independent test set.
Keywords: Effector; Machine learning; Sequence analysis; Type IV secretion system.
Similar articles
-
PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method.Front Microbiol. 2018 Oct 26;9:2571. doi: 10.3389/fmicb.2018.02571. eCollection 2018. Front Microbiol. 2018. PMID: 30416498 Free PMC article.
-
Systematic analysis and prediction of type IV secreted effector proteins by machine learning approaches.Brief Bioinform. 2019 May 21;20(3):931-951. doi: 10.1093/bib/bbx164. Brief Bioinform. 2019. PMID: 29186295 Free PMC article.
-
T4SEpp: A pipeline integrating protein language models to predict bacterial type IV secreted effectors.Comput Struct Biotechnol J. 2024 Jan 23;23:801-812. doi: 10.1016/j.csbj.2024.01.015. eCollection 2024 Dec. Comput Struct Biotechnol J. 2024. PMID: 38328004 Free PMC article.
-
Computational prediction of secretion systems and secretomes of Brucella: identification of novel type IV effectors and their interaction with the host.Mol Biosyst. 2016 Jan;12(1):178-90. doi: 10.1039/c5mb00607d. Epub 2015 Nov 17. Mol Biosyst. 2016. PMID: 26575364
-
Features and algorithms: facilitating investigation of secreted effectors in Gram-negative bacteria.Trends Microbiol. 2023 Nov;31(11):1162-1178. doi: 10.1016/j.tim.2023.05.011. Epub 2023 Jun 20. Trends Microbiol. 2023. PMID: 37349207 Review.
Cited by
-
Protein-Specific Prediction of RNA-Binding Sites Based on Information Entropy.Comput Intell Neurosci. 2022 Oct 3;2022:8626628. doi: 10.1155/2022/8626628. eCollection 2022. Comput Intell Neurosci. 2022. PMID: 36225547 Free PMC article.
-
iT4SE-EP: Accurate Identification of Bacterial Type IV Secreted Effectors by Exploring Evolutionary Features from Two PSI-BLAST Profiles.Molecules. 2021 Apr 24;26(9):2487. doi: 10.3390/molecules26092487. Molecules. 2021. PMID: 33923273 Free PMC article.
-
DeepT3_4: A Hybrid Deep Neural Network Model for the Distinction Between Bacterial Type III and IV Secreted Effectors.Front Microbiol. 2021 Jan 21;12:605782. doi: 10.3389/fmicb.2021.605782. eCollection 2021. Front Microbiol. 2021. PMID: 33552038 Free PMC article.
-
T4SE-XGB: Interpretable Sequence-Based Prediction of Type IV Secreted Effectors Using eXtreme Gradient Boosting Algorithm.Front Microbiol. 2020 Sep 24;11:580382. doi: 10.3389/fmicb.2020.580382. eCollection 2020. Front Microbiol. 2020. PMID: 33072049 Free PMC article.
-
PredT4SE-Stack: Prediction of Bacterial Type IV Secreted Effectors From Protein Sequences Using a Stacked Ensemble Method.Front Microbiol. 2018 Oct 26;9:2571. doi: 10.3389/fmicb.2018.02571. eCollection 2018. Front Microbiol. 2018. PMID: 30416498 Free PMC article.
References
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources