Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Nov;31(11):e4466.
doi: 10.1002/pro.4466.

Intrinsic protein disorder and conditional folding in AlphaFoldDB

Affiliations

Intrinsic protein disorder and conditional folding in AlphaFoldDB

Damiano Piovesan et al. Protein Sci. 2022 Nov.

Abstract

Intrinsically disordered regions (IDRs) defying the traditional protein structure-function paradigm have been difficult to analyze. The availability of accurate structure predictions on a large scale in AlphaFoldDB offers a fresh perspective on IDR prediction. Here, we establish three baselines for IDR prediction from AlphaFoldDB models based on the recent CAID dataset. Surprisingly, AlphaFoldDB is highly competitive for predicting both IDRs and conditionally folded binding regions, demonstrating the plasticity of the disorder to structure continuum.

Keywords: CAID; critical assessment; intrinsically disordered proteins; machine learning; protein structure; structural bioinformatics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

FIGURE 1
FIGURE 1
Example of intrinsically disordered regions and conditional folding predictions derived from AlphaFold and best predictors. The human Ephrin‐B2 protein (UniProt accession: P52799) is shown as a representative example to illustrate the overlap between AlphaFold predictions and various sequence features. Panel a, the structure of the protein predicted by AlphaFold and colored by pLDDT score (<50 orange, <70 yellow, <90 light blue, >90 dark blue). Residue labels indicate annotated region boundaries. Panel b, database annotations (DisProt DP01588, PDB, InterPro P52799) and predicted regions (AlphaFold‐pLDDT, AlphaFold‐RSA, AlphaFold‐Bind, fIDPnn, ANCHOR‐2 19 ). PDB annotation is generated by combining observed residues in different PDB experiments. Best predictors were selected based on their performance against DisProt and DisProt‐binding references. Annotated regions are shown colored according to the legend on top of panel b (i.e., disorder in red, binding in gold, structure in blue, other features in gray, while white regions correspond to no annotation. Per‐residue AlphaFold predictions are provided in Figure S1
FIGURE 2
FIGURE 2
Results for AlphaFold on the three main CAID categories. The results for the DisProt‐PDB (n = 646 proteins, panels a,b), DisProt (n = 646 proteins, panels c, d), and DisProt‐binding (n = 646 proteins, panels e, f) references are shown. Performance of predictors expressed as maximum F1‐Score across all thresholds (F max) (panels a, c, e) and AUC (panels b, d, f) for AlphaFold (colored), the top 10 best ranking methods (gray) and baselines (white symbols) are shown. The legend on the right of each panel shows the name of the method alongside its F max or AUC score (f and a, respectively) and coverage (c). Notice how the latter is usually 1.0 for most predictors, but only 0.76 for AlphaFold as predictions for some targets are not available

References

    1. Kryshtafovych A, Schwede T, Topf M, Fidelis K, Moult J. Critical assessment of methods of protein structure prediction (CASP)—Round XIV. Proteins Struct Funct Bioinform. 2021;89:1607–1617. - PMC - PubMed
    1. Pereira J, Simpkin AJ, Hartmann MD, Rigden DJ, Keegan RM, Lupas AN. High‐accuracy protein structure prediction in CASP14. Proteins. 2021;89:1687–1699. - PubMed
    1. Jumper J, Evans R, Pritzel A, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. - PMC - PubMed
    1. Subramaniam S, Kleywegt GJ. A paradigm shift in structural biology. Nat Methods. 2022;19:20–23. - PubMed
    1. Cramer P. AlphaFold2 and the future of structural biology. Nat Struct Mol Biol. 2021;28:704–705. - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources