Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 1;35(22):4862-4865.
doi: 10.1093/bioinformatics/btz422.

AlphaFold at CASP13

Affiliations

AlphaFold at CASP13

Mohammed AlQuraishi. Bioinformatics. .

Abstract

Computational prediction of protein structure from sequence is broadly viewed as a foundational problem of biochemistry and one of the most difficult challenges in bioinformatics. Once every two years the Critical Assessment of protein Structure Prediction (CASP) experiments are held to assess the state of the art in the field in a blind fashion, by presenting predictor groups with protein sequences whose structures have been solved but have not yet been made publicly available. The first CASP was organized in 1994, and the latest, CASP13, took place last December, when for the first time the industrial laboratory DeepMind entered the competition. DeepMind's entry, AlphaFold, placed first in the Free Modeling (FM) category, which assesses methods on their ability to predict novel protein folds (the Zhang group placed first in the Template-Based Modeling (TBM) category, which assess methods on predicting proteins whose folds are related to ones already in the Protein Data Bank.) DeepMind's success generated significant public interest. Their approach builds on two ideas developed in the academic community during the preceding decade: (i) the use of co-evolutionary analysis to map residue co-variation in protein sequence to physical contact in protein structure, and (ii) the application of deep neural networks to robustly identify patterns in protein sequence and co-evolutionary couplings and convert them into contact maps. In this Letter, we contextualize the significance of DeepMind's entry within the broader history of CASP, relate AlphaFold's methodological advances to prior work, and speculate on the future of this important problem.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Historical CASP performance in prediction of gross protein topology. Curves show the best and second best predictors at each CASP, while the dashed line shows the expected improvement at CASP13 given the average rate of improvement from CASP10 to 12. Ranking is based on CASP assessor’s formula, and does not always coincide with highest mean GDT_TS (e.g. CASP10). Error bars correspond to 95% confidence intervals
Fig. 2.
Fig. 2.
Historical CASP performance in prediction of fine-grained protein topology. Curves show the best and second best predictors at each CASP, while the dashed line shows the expected improvement at CASP13 given the average rate of improvement from CASP10 to 12. Ranking is based on CASP assessor’s formula, and does not always coincide with highest mean GDT_HA (e.g. CASP10). Error bars correspond to 95% confidence intervals

References

    1. Alley E. et al. (2019) Unified rational protein engineering with sequence-only deep representation learning. bioRxiv, 589333. - PMC - PubMed
    1. AlQuraishi M. (2019) End-to-end differentiable learning of protein structure. Cell Systems, 8, 292.e3–301.e3. - PMC - PubMed
    1. Anand N. et al. (2018) Generative modeling for protein structures In: Bengio S.et al. (eds) Advances in Neural Information Processing Systems 31. Curran Associates, Inc, Montreal, pp. 7505–7516.
    1. Bepler T., Berger B. (2018) Learning protein sequence embeddings using information from structure. ICLR 2019.
    1. Brünger A.T. et al. (1998) Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr, 54, 905–921. - PubMed

Publication types