Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Oct 2:7:64.
doi: 10.1186/1472-6807-7-64.

Antibody-protein interactions: benchmark datasets and prediction tools evaluation

Affiliations

Antibody-protein interactions: benchmark datasets and prediction tools evaluation

Julia V Ponomarenko et al. BMC Struct Biol. .

Abstract

Background: The ability to predict antibody binding sites (aka antigenic determinants or B-cell epitopes) for a given protein is a precursor to new vaccine design and diagnostics. Among the various methods of B-cell epitope identification X-ray crystallography is one of the most reliable methods. Using these experimental data computational methods exist for B-cell epitope prediction. As the number of structures of antibody-protein complexes grows, further interest in prediction methods using 3D structure is anticipated. This work aims to establish a benchmark for 3D structure-based epitope prediction methods.

Results: Two B-cell epitope benchmark datasets inferred from the 3D structures of antibody-protein complexes were defined. The first is a dataset of 62 representative 3D structures of protein antigens with inferred structural epitopes. The second is a dataset of 82 structures of antibody-protein complexes containing different structural epitopes. Using these datasets, eight web-servers developed for antibody and protein binding sites prediction have been evaluated. In no method did performance exceed a 40% precision and 46% recall. The values of the area under the receiver operating characteristic curve for the evaluated methods were about 0.6 for ConSurf, DiscoTope, and PPI-PRED methods and above 0.65 but not exceeding 0.70 for protein-protein docking methods when the best of the top ten models for the bound docking were considered; the remaining methods performed close to random. The benchmark datasets are included as a supplement to this paper.

Conclusion: It may be possible to improve epitope prediction methods through training on datasets which include only immune epitopes and through utilizing more features characterizing epitopes, for example, the evolutionary conservation score. Notwithstanding, overall poor performance may reflect the generality of antigenicity and hence the inability to decipher B-cell epitopes as an intrinsic feature of the protein. It is an open question as to whether ultimately discriminatory features can be found.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart for building benchmark datasets.
Figure 2
Figure 2
Hypothetical example of the structural alignment of proteins (A) (sequence AVCQYWC) and (B) (sequence ACYARTYC). Number of residue-residue matches = 5, number of residue-residue matches relative to the length the longest chain = 63% (5/8), sequence identity = 80% (4/5).
Figure 3
Figure 3
Two orthogonal views of a representative structure, influenza A virus hemagglutinin HA1 chain [PDB:1EO8]. Chain A is shown in light gray upon which are mapped epitope residues inferred from six protein structures in complexes with antibody fragments: HC45 Fab [PDB:1QFU] (blue), BH151 Fab [PDB:1EO8] (magenta), HC63 Fab [PDB:1KEN] (green), HC19 Fab [PDB:2VIR, 2VIS, 2VIT] (red). The hemagglutinin HA2 chain is shown in cyan. Residues common to HC45 and BH151 epitopes are shown in orange; residues common to HC63 and HC19 epitopes are shown in yellow; residue Tyr98 which is a part of HC19 epitope inferred from structure 2VIR but not from 2VIS and 2VIT structures is shown in black; The HC19 epitope residue Thr131 which is mutated to Ile in the 2VIS structure is shown in dark red. The HC19 epitope residue Thr155 which is mutated to Ile in 2VIT structure is shown in violet.
Figure 4
Figure 4
Distributions of DiscoTope scores for epitope, non-epitope and all protein residues.
Figure 5
Figure 5
Distribution of ConSurf scores for epitope and all protein residues. For the definition of confidence score see the Methods section.
Figure 6
Figure 6
Distribution of ProMate scores for epitope, non-epitope and all protein residues.
Figure 7
Figure 7
Distribution of PIER scores for epitope, non-epitope and all protein residues.
Figure 8
Figure 8
Average AUC values for each method. Vertical bars show one standard deviation.
Figure 9
Figure 9
Overall methods performance measured as average sensitivity and PPV values.
Figure 10
Figure 10
Proportion of successfully predicted epitopes.

Similar articles

Cited by

References

    1. Peters B, Sidney J, Bourne P, Bui HH, Buus S, Doh G, Fleri W, Kronenberg M, Kubo R, Lund O, Nemazee D, Ponomarenko JV, Sathiamurthy M, Schoenberger SP, Stewart S, Surko P, Way S, Wilson S, Sette A. The design and implementation of the immune epitope database and analysis resource. Immunogenetics. 2005;57:326–336. - PMC - PubMed
    1. Van Regenmortel MH. Immunoinformatics may lead to a reappraisal of the nature of B cell epitopes and of the feasibility of synthetic peptide vaccines. J Mol Recognit. 2006;19:183–187. - PubMed
    1. Gomara MJ, Haro I. Synthetic peptides for the immunodiagnosis of human diseases. Curr Med Chem. 2007;14:531–546. - PubMed
    1. Meloen RH, Puijk WC, Langeveld JP, Langedijk JP, Timmerman P. Design of synthetic peptides for diagnostics. Curr Protein Pept Sci. 2003;4:253–260. - PubMed
    1. Leinikki P, Lehtinen M, Hyoty H, Parkkonen P, Kantanen ML, Hakulinen J. Synthetic peptides as diagnostic tools in virology. Adv Virus Res. 1993;42:149–186. - PubMed

Publication types

MeSH terms