Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Nov;15(11):2558-67.
doi: 10.1110/ps.062405906. Epub 2006 Sep 25.

Prediction of residues in discontinuous B-cell epitopes using protein 3D structures

Affiliations

Prediction of residues in discontinuous B-cell epitopes using protein 3D structures

Pernille Haste Andersen et al. Protein Sci. 2006 Nov.

Abstract

Discovery of discontinuous B-cell epitopes is a major challenge in vaccine design. Previous epitope prediction methods have mostly been based on protein sequences and are not very effective. Here, we present DiscoTope, a novel method for discontinuous epitope prediction that uses protein three-dimensional structural data. The method is based on amino acid statistics, spatial information, and surface accessibility in a compiled data set of discontinuous epitopes determined by X-ray crystallography of antibody/antigen protein complexes. DiscoTope is the first method to focus explicitly on discontinuous epitopes. We show that the new structure-based method has a better performance for predicting residues of discontinuous epitopes than methods based solely on sequence information, and that it can successfully predict epitope residues that have been identified by different techniques. DiscoTope detects 15.5% of residues located in discontinuous epitopes with a specificity of 95%. At this level of specificity, the conventional Parker hydrophilicity scale for predicting linear B-cell epitopes identifies only 11.0% of residues located in discontinuous epitopes. Predictions by the DiscoTope method can guide experimental epitope mapping in both rational vaccine design and development of diagnostic tools, and may lead to more efficient epitope identification.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Analysis of the complete data set of discontinuous B-cell epitopes. (A) Distribution of the number of residues per epitope. (B) Distribution of the number of residues per sequential stretch of epitopes. (C) Distribution of the maximum length of a sequential stretch per epitope.
Figure 2.
Figure 2.
Contact numbers of epitope residues in the data set compared to nonepitope residues. The curves show the distribution of contact numbers for epitope residues (red curve) compared to nonepitope residues (black curve). The vertical lines represent the mean value of contact numbers for the epitope residues (red line) and for the nonepitope residues (black line).
Figure 3.
Figure 3.
Evaluation of B-cell epitope prediction methods. The average AUC of various methods on the five evaluation sets. “Log e-ne” denotes raw log-odds ratios; “Parker” denotes the Parker hydrophilicity scale; “Win9 log e-ne” denotes the log-odds ratios used with a smoothing window of nine residues; “Contact” denotes contact numbers; and “Naccess” denotes NACCESS RSA values. (A) The performance of single methods. (B) The performance of simple combination methods using contact numbers. (C) The performance of simple combination methods using NACCESS RSA values. (D) The performance of the structural proximity sum methods.
Figure 4.
Figure 4.
Dot plots showing comparisons of performances of the Parker method, the NACCESS RSA method, and the DiscoTope method. Circles indicate average AUC per group showed for the 25 groups of different antigens. The dotted lines indicate points where the methods perform equally.
Figure 5.
Figure 5.
Structure of the 1AR1 antigen. The antigen is a subunit of the cytochrome c oxidase (Ostermeier et al. 1997). (A) The 30% of residues with lowest contact numbers are shown in green. In red is shown a residue that is part of the 30% with lowest contact numbers and of the epitope from the data set. The rest of the epitope is shown in blue. (B) The structure mentioned above and rotated 90 degrees. (C) The complex of the cytochrome c oxidase with antibody fragments. The 1AR1 antigen is color coded as in A and B. Antibody fragments are shown in light blue. The other subunit of the cytochrome c oxidase is shown in yellow. Membrane spanning helices of the 1AR1 antigen are part of the lower half of the structure.
Figure 6.
Figure 6.
Predicted epitope residues of the AMA1 ectodomain. Backbone atoms of residues predicted by DiscoTope as parts of epitopes are highlighted in green. Side chains of the residues mapped to the monoclonal antibodies 1F9 and 4G2 are shown in black.

References

    1. Alix, A.J. 1999. Predictive estimation of protein linear epitopes by using the program PEOPLE. Vaccine 18 311–314. - PubMed
    1. Allcorn, L.C. and Martin, A.C. 2002. SACS–Self-maintaining database of antibody crystal structure information. Bioinformatics 18 175–181. - PubMed
    1. Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D.J. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25 3389–3402. - PMC - PubMed
    1. Bai, T., Becker, M., Gupta, A., Strike, P., Murphy, V.J., Anders, R.F., and Batchelor, A.H. 2005. Structure of AMA1 from Plasmodium falciparum reveals a clustering of polymorphisms that surround a conserved hydrophobic pocket. Proc. Natl. Acad. Sci. 102 12736–12741. - PMC - PubMed
    1. Barlow, D.J., Edwards, M.S., and Thornton, J.M. 1986. Continuous and discontinuous protein antigenic determinants. Nature 322 747–748. - PubMed

Publication types

MeSH terms

LinkOut - more resources