Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 5;52(D1):D434-D441.
doi: 10.1093/nar/gkad928.

DisProt in 2024: improving function annotation of intrinsically disordered proteins

Collaborators, Affiliations

DisProt in 2024: improving function annotation of intrinsically disordered proteins

Maria Cristina Aspromonte et al. Nucleic Acids Res. .

Erratum in

Abstract

DisProt (URL: https://disprot.org) is the gold standard database for intrinsically disordered proteins and regions, providing valuable information about their functions. The latest version of DisProt brings significant advancements, including a broader representation of functions and an enhanced curation process. These improvements aim to increase both the quality of annotations and their coverage at the sequence level. Higher coverage has been achieved by adopting additional evidence codes. Quality of annotations has been improved by systematically applying Minimum Information About Disorder Experiments (MIADE) principles and reporting all the details of the experimental setup that could potentially influence the structural state of a protein. The DisProt database now includes new thematic datasets and has expanded the adoption of Gene Ontology terms, resulting in an extensive functional repertoire which is automatically propagated to UniProtKB. Finally, we show that DisProt's curated annotations strongly correlate with disorder predictions inferred from AlphaFold2 pLDDT (predicted Local Distance Difference Test) confidence scores. This comparison highlights the utility of DisProt in explaining apparent uncertainty of certain well-defined predicted structures, which often correspond to folding-upon-binding fragments. Overall, DisProt serves as a comprehensive resource, combining experimental evidence of disorder information to enhance our understanding of intrinsically disordered proteins and their functional implications.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Comparison of the disorder content at the protein level in DisProt and AlphaFoldDB. The disorder content is calculated as the fraction of disordered residues over the protein sequence length. DisProt disorder content corresponds to the fraction of residues in the consensus, which includes structurally disordered regions. Only DisProt proteins with an AlphaFold structure covering the entire protein sequence in AlphaFoldDB were considered, n = 2356. (A) Correlation of the disorder content between DisProt and AlphaFold when different pLDDT thresholds are selected. (B) Comparison of the disorder content between DisProt and AlphaFold when the AlphaFold pLDDT < 70. The red dotted line represents the linear least-squares regression between the two dimensions, with slope 0.462 ± 0.021 and intercept 0.114 ± 0.009.
Figure 2.
Figure 2.
The number of DisProt proteins annotated with functional terms. The statistic is provided for the three Gene Ontology namespaces, as well as for the ‘Disorder function’ aspect from the IDPO ontology. The calculation considers only the first 15 most used annotation terms. Before the calculation, both GO and IDPO terms were propagated to the corresponding ontology root. Proteins with multiple identical annotations, e.g. when different articles report the same experimental evidence, are counted only once.

Similar articles

  • DisProt in 2022: improved quality and accessibility of protein intrinsic disorder annotation.
    Quaglia F, Mészáros B, Salladini E, Hatos A, Pancsa R, Chemes LB, Pajkos M, Lazar T, Peña-Díaz S, Santos J, Ács V, Farahi N, Fichó E, Aspromonte MC, Bassot C, Chasapi A, Davey NE, Davidović R, Dobson L, Elofsson A, Erdős G, Gaudet P, Giglio M, Glavina J, Iserte J, Iglesias V, Kálmán Z, Lambrughi M, Leonardi E, Longhi S, Macedo-Ribeiro S, Maiani E, Marchetti J, Marino-Buslje C, Mészáros A, Monzon AM, Minervini G, Nadendla S, Nilsson JF, Novotný M, Ouzounis CA, Palopoli N, Papaleo E, Pereira PJB, Pozzati G, Promponas VJ, Pujols J, Rocha ACS, Salas M, Sawicki LR, Schad E, Shenoy A, Szaniszló T, Tsirigos KD, Veljkovic N, Parisi G, Ventura S, Dosztányi Z, Tompa P, Tosatto SCE, Piovesan D. Quaglia F, et al. Nucleic Acids Res. 2022 Jan 7;50(D1):D480-D487. doi: 10.1093/nar/gkab1082. Nucleic Acids Res. 2022. PMID: 34850135 Free PMC article.
  • DisProt: intrinsic protein disorder annotation in 2020.
    Hatos A, Hajdu-Soltész B, Monzon AM, Palopoli N, Álvarez L, Aykac-Fas B, Bassot C, Benítez GI, Bevilacqua M, Chasapi A, Chemes L, Davey NE, Davidović R, Dunker AK, Elofsson A, Gobeill J, Foutel NSG, Sudha G, Guharoy M, Horvath T, Iglesias V, Kajava AV, Kovacs OP, Lamb J, Lambrughi M, Lazar T, Leclercq JY, Leonardi E, Macedo-Ribeiro S, Macossay-Castillo M, Maiani E, Manso JA, Marino-Buslje C, Martínez-Pérez E, Mészáros B, Mičetić I, Minervini G, Murvai N, Necci M, Ouzounis CA, Pajkos M, Paladin L, Pancsa R, Papaleo E, Parisi G, Pasche E, Barbosa Pereira PJ, Promponas VJ, Pujols J, Quaglia F, Ruch P, Salvatore M, Schad E, Szabo B, Szaniszló T, Tamana S, Tantos A, Veljkovic N, Ventura S, Vranken W, Dosztányi Z, Tompa P, Tosatto SCE, Piovesan D. Hatos A, et al. Nucleic Acids Res. 2020 Jan 8;48(D1):D269-D276. doi: 10.1093/nar/gkz975. Nucleic Acids Res. 2020. PMID: 31713636 Free PMC article.
  • Exploring Manually Curated Annotations of Intrinsically Disordered Proteins with DisProt.
    Quaglia F, Hatos A, Piovesan D, Tosatto SCE. Quaglia F, et al. Curr Protoc Bioinformatics. 2020 Dec;72(1):e107. doi: 10.1002/cpbi.107. Curr Protoc Bioinformatics. 2020. PMID: 33017101
  • Comparative Assessment of Intrinsic Disorder Predictions with a Focus on Protein and Nucleic Acid-Binding Proteins.
    Katuwawala A, Kurgan L. Katuwawala A, et al. Biomolecules. 2020 Dec 4;10(12):1636. doi: 10.3390/biom10121636. Biomolecules. 2020. PMID: 33291838 Free PMC article. Review.
  • Minimum information guidelines for experiments structurally characterizing intrinsically disordered protein regions.
    Mészáros B, Hatos A, Palopoli N, Quaglia F, Salladini E, Van Roey K, Arthanari H, Dosztányi Z, Felli IC, Fischer PD, Hoch JC, Jeffries CM, Longhi S, Maiani E, Orchard S, Pancsa R, Papaleo E, Pierattelli R, Piovesan D, Pritisanac I, Tenorio L, Viennet T, Tompa P, Vranken W, Tosatto SCE, Davey NE. Mészáros B, et al. Nat Methods. 2023 Sep;20(9):1291-1303. doi: 10.1038/s41592-023-01915-x. Epub 2023 Jul 3. Nat Methods. 2023. PMID: 37400558 Review.

Cited by

References

    1. Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., Tunyasuvunakool K., Bates R., Žídek A., Potapenko A.et al. .. Highly accurate protein structure prediction with AlphaFold. Nature. 2021; 596:583–589. - PMC - PubMed
    1. Tompa P., Fersht A.. Structure and Function of Intrinsically Disordered Proteins. 2009; CRC Press.
    1. Porta-Pardo E., Ruiz-Serra V., Valentini S., Valencia A.. The structural coverage of the human proteome before and after AlphaFold. PLoS Comput. Biol. 2022; 18:e1009818. - PMC - PubMed
    1. Wright P.E., Dyson H.J.. Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 2015; 16:18–29. - PMC - PubMed
    1. Ruan H., Sun Q., Zhang W., Liu Y., Lai L.. Targeting intrinsically disordered proteins at the edge of chaos. Drug Discov. Today. 2019; 24:217–227. - PubMed

MeSH terms

Substances