Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 11:16:11779322221095221.
doi: 10.1177/11779322221095221. eCollection 2022.

EpiBuilder: A Tool for Assembling, Searching, and Classifying B-Cell Epitopes

Affiliations

EpiBuilder: A Tool for Assembling, Searching, and Classifying B-Cell Epitopes

Renato Simões Moreira et al. Bioinform Biol Insights. .

Abstract

Epitopes are portions of a protein that are recognized by antibodies. These small amino acid sequences represent a significant breakthrough in a branch of bioinformatics called immunoinformatics. Various software are available for linear B-cell epitope (BCE) prediction such as ABCPred, SVMTrip, EpiDope, and EpitopeVec; a well-known BCE predictor is BepiPred-2.0. However, despite the prediction, there are several essential steps, such as epitope assembly, evaluation, and searching for epitopes in other proteomes. Here, we present EpiBuilder (https://epibuilder.sourceforge.io), a user friendly software that assists in epitope assembly, classifying and searching using input results of BepiPred-2.0. EpiBuilder generates several output results from these data and supports a proteome-wide processing approach. In addition, this software provides the following features: Chou & Fasman beta-turn prediction, Emini surface accessibility prediction, Karplus and Schulz flexibility prediction, Kolaskar and Tongaonkar antigenicity, Parker hydrophilicity prediction, N-glycosylation domains, and hydropathy. These information generate a unique topology for each epitope, visually demonstrating its characteristics. The software can search the entire epitope sequence in various FASTA files, and it allows to use BLASTP to identify epitopes that eventually have sequence variations. As an EpiBuilder application, we developed a epitope dataset from the protozoan Trypanosoma brucei gambiense, the gram-positive bacterium Clostridioides difficile, and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).

Keywords: BepiPred-2.0; Galaxy; Immunoinformatics.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interests: The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Figures

Figure 1.
Figure 1.
Topology generated by EpiBuilder, the epitope is individually analyzed with visual presentation of its features, besides presenting the element All Matches, where the amino acid is above the cutoff point in all selected methods. The topology also presents the sites of N-glycosylation and also hydropathy.
Figure 2.
Figure 2.
The complete EpiBuilder pipeline, indicating input files (BepiPred-2.0 files and proteomes for query) and the internal process such as input loading and processing of software scores, searching for N-glycosylation, assembling of the topology, and the searching in proteomes. After all these steps, the reports are generated in several files. IEDB indicates Immune Epitope Database; MW, molecular mass; IP, Isoleletric Point.
Figure 3.
Figure 3.
EpiBuilder desktop main screen. From this screen, the user can configure the input file, threshold, the minimum and maximum size of epitopes, the additional features it intends to analyze, the proteomes for searching the epitopes, and the option to search proteomes by BLASTP.
Figure 4.
Figure 4.
EpiBuilder main screen on galaxy platform. From this screen, the user can configure the input file, threshold, the minimum and maximum size of epitopes, the additional features it intends to analyze, the proteomes for searching the epitopes, and the option to search proteomes by BLASTP.
Figure 5.
Figure 5.
Alignment of the spike proteins (S) of SARS-CoV-2, compared with the VoC variants where it is possible to analyze that there are mutations that impact amino acids present in identified areas epitopes, especially in the omicron variant. The alignment was performed using Clustal Omega with default parameter values. SARS-CoV-2 indicates severe acute respiratory syndrome coronavirus 2; VoC, variant of concern.

References

    1. Abbas A, Lichtman A, Pillai S. Cellular and Molecular Immunology. 9th ed. Amsterdam, The Netherlands: Elsevier; 2017.
    1. Shirai H, Prades C, Vita R, et al.. Antibody informatics for drug discovery. Biochim Biophys Acta. 2014;1844:2002-2015. doi:10.1016/j.bbapap.2014.07.006. - DOI - PubMed
    1. Manavalan B, Govindaraj RG, Shin TH, Kim MO, Lee G. iBCE-EL: a new ensemble learning framework for improved linear B-cell epitope prediction. Front Immunol. 2018;9:1695. - PMC - PubMed
    1. Ahammad I, Lira SS. Designing a novel mRNA vaccine against SARS-CoV-2: an immunoinformatics approach. Int J Biol Macromol. 2020;162:820-837. - PMC - PubMed
    1. Sadat SM, Aghadadeghi MR, Yousefi M, Khodaei A, Sadat Larijani M, Bahramali G. Bioinformatics analysis of SARS-CoV-2 to approach an effective vaccine candidate against COVID-19. Mol Biotechnol. 2021;63:389-409. doi:10.1007/s12033-021-00303-0. - DOI - PMC - PubMed