Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 5;52(D1):D434-D441.
doi: 10.1093/nar/gkad928.

DisProt in 2024: improving function annotation of intrinsically disordered proteins

Collaborators, Affiliations

DisProt in 2024: improving function annotation of intrinsically disordered proteins

Maria Cristina Aspromonte et al. Nucleic Acids Res. .

Erratum in

Abstract

DisProt (URL: https://disprot.org) is the gold standard database for intrinsically disordered proteins and regions, providing valuable information about their functions. The latest version of DisProt brings significant advancements, including a broader representation of functions and an enhanced curation process. These improvements aim to increase both the quality of annotations and their coverage at the sequence level. Higher coverage has been achieved by adopting additional evidence codes. Quality of annotations has been improved by systematically applying Minimum Information About Disorder Experiments (MIADE) principles and reporting all the details of the experimental setup that could potentially influence the structural state of a protein. The DisProt database now includes new thematic datasets and has expanded the adoption of Gene Ontology terms, resulting in an extensive functional repertoire which is automatically propagated to UniProtKB. Finally, we show that DisProt's curated annotations strongly correlate with disorder predictions inferred from AlphaFold2 pLDDT (predicted Local Distance Difference Test) confidence scores. This comparison highlights the utility of DisProt in explaining apparent uncertainty of certain well-defined predicted structures, which often correspond to folding-upon-binding fragments. Overall, DisProt serves as a comprehensive resource, combining experimental evidence of disorder information to enhance our understanding of intrinsically disordered proteins and their functional implications.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Comparison of the disorder content at the protein level in DisProt and AlphaFoldDB. The disorder content is calculated as the fraction of disordered residues over the protein sequence length. DisProt disorder content corresponds to the fraction of residues in the consensus, which includes structurally disordered regions. Only DisProt proteins with an AlphaFold structure covering the entire protein sequence in AlphaFoldDB were considered, n = 2356. (A) Correlation of the disorder content between DisProt and AlphaFold when different pLDDT thresholds are selected. (B) Comparison of the disorder content between DisProt and AlphaFold when the AlphaFold pLDDT < 70. The red dotted line represents the linear least-squares regression between the two dimensions, with slope 0.462 ± 0.021 and intercept 0.114 ± 0.009.
Figure 2.
Figure 2.
The number of DisProt proteins annotated with functional terms. The statistic is provided for the three Gene Ontology namespaces, as well as for the ‘Disorder function’ aspect from the IDPO ontology. The calculation considers only the first 15 most used annotation terms. Before the calculation, both GO and IDPO terms were propagated to the corresponding ontology root. Proteins with multiple identical annotations, e.g. when different articles report the same experimental evidence, are counted only once.

References

    1. Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Žídek A., Potapenko A.et al. .. Highly accurate protein structure prediction with AlphaFold. Nature. 2021; 596:583–589. - PMC - PubMed
    1. Tompa P., Fersht A.. Structure and Function of Intrinsically Disordered Proteins. 2009; CRC Press.
    1. Porta-Pardo E., Ruiz-Serra V., Valentini S., Valencia A.. The structural coverage of the human proteome before and after AlphaFold. PLoS Comput. Biol. 2022; 18:e1009818. - PMC - PubMed
    1. Wright P.E., Dyson H.J.. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 2015; 16:18–29. - PMC - PubMed
    1. Ruan H., Sun Q., Zhang W., Liu Y., Lai L.. Targeting intrinsically disordered proteins at the edge of chaos. Drug Discov. Today. 2019; 24:217–227. - PubMed

MeSH terms

Substances