Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 May 5:11:816.
doi: 10.3389/fimmu.2020.00816. eCollection 2020.

EpitoCore: Mining Conserved Epitope Vaccine Candidates in the Core Proteome of Multiple Bacteria Strains

Affiliations

EpitoCore: Mining Conserved Epitope Vaccine Candidates in the Core Proteome of Multiple Bacteria Strains

Tayna S Fiuza et al. Front Immunol. .

Abstract

In reverse vaccinology approaches, complete proteomes of bacteria are submitted to multiple computational prediction steps in order to filter proteins that are possible vaccine candidates. Most available tools perform such analysis only in a single strain, or a very limited number of strains. But the vast amount of genomic data had shown that most bacteria contain pangenomes, i.e., their genomic information contains core, conserved genes, and random accessory genes specific to each strain. Therefore, in reverse vaccinology methods it is of the utmost importance to define core proteins and core epitopes. EpitoCore is a decision-tree pipeline developed to fulfill that need. It provides surfaceome prediction of proteins from related strains, defines core proteins within those, calculate their immunogenicity, predicts epitopes for a given set of MHC alleles defined by the user, and then reports if epitopes are located extracellularly and if they are conserved among the core homologs. Pipeline performance is illustrated by mining peptide vaccine candidates in Mycobacterium avium hominissuis strains. From a total proteome of ~4,800 proteins per strain, EpitoCore predicted 103 highly immunogenic core homologs located at cell surface, many of those related to virulence and drug resistance. Conserved epitopes identified among these homologs allows the users to define sets of peptides with potential to immunize the largest coverage of tested HLA alleles using peptide-based vaccines. Therefore, EpitoCore is able to provide automated identification of conserved epitopes in bacterial pangenomic datasets.

Keywords: bioinformatics; epitope prediction; pangenome; prokaryotes; reverse vaccinology; vaccine candidates.

PubMed Disclaimer

Figures

Figure 1
Figure 1
EpitoCore decision tree. Briefly, users retrieve fasta files for organism of interest, and perform transmembrane (TMHMM) and cellular localization (PSORT-B) predictions separately. TMHMM output is further divided into proteins with helix outside or not of the signal sequence. Homologs within surfaceome predictions are clustered with CMG Biotools. Core clusters had their epitopes ranked by IEDB, and that output is aligned to protein topology. Conserved epitopes in all strains are then reported. Numbers 1–7 in workflow show location where each in house script is called.
Figure 2
Figure 2
Distribution of homologs within clusters for TMHMM dataset. Proteins with membrane helix outside the signal peptide were clustered, and cluster composition was quantified. Majority of the clusters had, as expected, at least seven proteins, one from each of the analyzed strains, defining the core group. Clusters with six or less components are in principle defined as accessory proteins from the pangenome. Only core proteins were considered for immunogenic analysis.
Figure 3
Figure 3
True core proteins with discrepant surfaceome prediction. To evaluate if clustering could be biased when performed on a smaller dataset, we also executed CMG Biotools in the whole proteome dataset of all strains. Protein composition for most groups was identical regardless if clustering was performed before or after surfaceome prediction (zero missing proteins when clusters are compared) (A). For core proteins (column 7), 98% of the protein groups were identical. For accessory proteins (groups 1–6), this was closer to 55%. When hiding identical clusters (B), it is evident that for most accessory clusters, the number of missing elements adds to the exact number of strains used. This illustrates protein groups with 7 components if no prediction is performed, i.e., true core proteins.
Figure 4
Figure 4
Immunogenicity Scores for TMHMM dataset and definition of valid epitopes. (A) All sequences within each cluster were submitted to MHC II immunogenicity scoring, to all tested alleles. Sequences with percentile score lower than 0.05 were selected. Cluster immunogenicity is given by the mean distribution of all scored epitopes under this value, for all proteins. Highly immunogenic clusters were defined as those with a mean score lower than 0.02 (black dots). (B) Example of a homolog from cluster 71 (Cytochrome c subunit III protein). Immunogenicity prediction defined four regions of this protein to contain possible epitopes (Before panel, in pink). As shown in the figure, three of those regions are within the transmembrane domain. Protein topology alignment allowed the removal of such regions, which were then considered invalid epitopes (After panel). (C) Valid epitopes were then checked if they are conserved within all homologs of its cluster. Most clusters contained conserved identical epitopes in all homologs (count 7, pink). Epitopes present in 5 or less components of the cluster were not considered for further analysis.

Similar articles

Cited by

References

    1. Finlay BB, Mcfadden G. Anti-immunology: evasion of the host immune system by bacterial and viral pathogens. Cell. (2006) 124:767–82. 10.1016/j.cell.2006.01.034 - DOI - PubMed
    1. Tan S, Tan HT, Chung MC. Membrane proteins and membrane proteomics. Proteomics. (2008) 8:3924–32. 10.1002/pmic.200800597 - DOI - PubMed
    1. Talukdar S, Zutshi S, Prashanth KS, Saikia KK, Kumar P. Identification of potential vaccine candidates against Streptococcus pneumoniae by reverse vaccinology approach. Appl Biochem Biotechnol. (2014) 172:3026–41. 10.1007/s12010-014-0749-x - DOI - PMC - PubMed
    1. Nagpal G, Usmani SS, Raghava GPS. A web resource for designing subunit vaccine against major pathogenic species of bacteria. Front Immunol. (2018) 9:2280. 10.3389/fimmu.2018.02280 - DOI - PMC - PubMed
    1. Nosrati M, Hajizade A, Nazarian S, Amani J, Namvar Vansofla A, Tarverdizadeh Y. Designing a multi-epitope vaccine for cross-protection against Shigella spp: an immunoinformatics and structural vaccinology study. Mol Immunol. (2019) 116:106–16. 10.1016/j.molimm.2019.09.018 - DOI - PubMed

Publication types

Supplementary concepts