Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Aug 25;18(8):e0290538.
doi: 10.1371/journal.pone.0290538. eCollection 2023.

TROLLOPE: A novel sequence-based stacked approach for the accelerated discovery of linear T-cell epitopes of hepatitis C virus

Affiliations

TROLLOPE: A novel sequence-based stacked approach for the accelerated discovery of linear T-cell epitopes of hepatitis C virus

Phasit Charoenkwan et al. PLoS One. .

Abstract

Hepatitis C virus (HCV) infection is a concerning health issue that causes chronic liver diseases. Despite many successful therapeutic outcomes, no effective HCV vaccines are currently available. Focusing on T cell activity, the primary effector for HCV clearance, T cell epitopes of HCV (TCE-HCV) are considered promising elements to accelerate HCV vaccine efficacy. Thus, accurate and rapid identification of TCE-HCVs is recommended to obtain more efficient therapy for chronic HCV infection. In this study, a novel sequence-based stacked approach, termed TROLLOPE, is proposed to accurately identify TCE-HCVs from sequence information. Specifically, we employed 12 different sequence-based feature descriptors from heterogeneous perspectives, such as physicochemical properties, composition-transition-distribution information and composition information. These descriptors were used in cooperation with 12 popular machine learning (ML) algorithms to create 144 base-classifiers. To maximize the utility of these base-classifiers, we used a feature selection strategy to determine a collection of potential base-classifiers and integrated them to develop the meta-classifier. Comprehensive experiments based on both cross-validation and independent tests demonstrated the superior predictive performance of TROLLOPE compared with conventional ML classifiers, with cross-validation and independent test accuracies of 0.745 and 0.747, respectively. Finally, a user-friendly online web server of TROLLOPE (http://pmlabqsar.pythonanywhere.com/TROLLOPE) has been developed to serve research efforts in the large-scale identification of potential TCE-HCVs for follow-up experimental verification.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. System flowchart of the proposed TROLLOPE.
The development and performance assessment of TROLLOPE involves five main steps: dataset preparation, feature representation, stacked model development, performance evaluation, and online web server deployment.
Fig 2
Fig 2. Performance evaluations of top-30 base-classifiers.
(A-B) Cross-validation AUC and ACC of top-30 base-classifiers. (C-D) Independent AUC and ACC of top-30 base-classifiers.
Fig 3
Fig 3
Performance comparison of TROLLOPE and top-five base-classifiers on the training (A–B) and independent (C–D) datasets.
Fig 4
Fig 4. t-distributed stochastic neighbor embedding (t-SNE) distribution of positive and negative samples on the training dataset, where TCE-HCV and non-TCE-HCV are represented with red and blue dots, respectively.
TROLLOPE (A) and top-five base-classifiers (B-F).
Fig 5
Fig 5
t-SNE plots of our new feature OPF (A) and top-three feature descriptors (B-D) (i.e. DDE, DPC, and TPC) on the training dataset.
Fig 6
Fig 6. Feature importance analysis for TROLLOPE prediction.
(A) Scatter plot of top-15 informative probabilistic features. (B) The average absolute SHAP values of top-15 informative probabilistic features.
Fig 7
Fig 7
Feature importance analysis for XGB-AAI (A-B) and XGB-PCP (C-D) predictions. (A, C) Scatter plot of top 20 informative features. (B, D) The average absolute SHAP values of top 20 informative features.

References

    1. Manns M. P. et al. , "Hepatitis C virus infection," Nat Rev Dis Primers, vol. 3, p. 17006, Mar 2 2017. doi: 10.1038/nrdp.2017.6 - DOI - PubMed
    1. Pol S. and Lagaye S., "The remarkable history of the hepatitis C virus," Genes Immun, vol. 20, no. 5, pp. 436–446, May 2019. doi: 10.1038/s41435-019-0066-z - DOI - PubMed
    1. Thomas E., Ghany M. G., and Liang T. J., "The application and mechanism of action of ribavirin in therapy of hepatitis C," Antivir Chem Chemother, vol. 23, no. 1, pp. 1–12, Sep 25 2012. doi: 10.3851/IMP2125 - DOI - PMC - PubMed
    1. Dustin L. B., Bartolini B., Capobianchi M. R., and Pistello M., "Hepatitis C virus: life cycle in cells, infection and host response, and analysis of molecular markers influencing the outcome of infection and response to therapy," Clin Microbiol Infect, vol. 22, no. 10, pp. 826–832, Oct 2016. doi: 10.1016/j.cmi.2016.08.025 - DOI - PMC - PubMed
    1. Bhattacharjee C., Singh M., Das D., Chaudhuri S., and Mukhopadhyay A., "Current therapeutics against HCV," Virusdisease, vol. 32, no. 2, pp. 228–243, Jun 2021. doi: 10.1007/s13337-021-00697-0 - DOI - PMC - PubMed

Publication types

Substances