Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Aug 26.
doi: 10.1002/prot.70045. Online ahead of print.

Critical Assessment of Protein Intrinsic Disorder Round 3 - Predicting Disorder in the Era of Protein Language Models

Affiliations

Critical Assessment of Protein Intrinsic Disorder Round 3 - Predicting Disorder in the Era of Protein Language Models

Mahta Mehdiabadi et al. Proteins. .

Abstract

Intrinsic disorder (ID) in proteins is a complex phenomenon, encompassing a continuum from entirely disordered regions to structured domains with flexible segments. The absence of a ground truth for all forms of disorder, combined with the possibility of structural transitions between ordered and disordered states under specific conditions, makes accurate prediction of ID especially challenging. The Critical Assessment of Protein Intrinsic Disorder (CAID) evaluates ID prediction methods using diverse benchmarks derived from DisProt, a manually curated database of experimentally validated annotations. This paper presents findings from the third (CAID3), in which 24 new methods were assessed along with the predictors from previous rounds. Compared to CAID2, the top-performing methods in CAID3 demonstrated significant gains in average precision: over 31% improvement in predicting linker regions, and 15% in disorder prediction. This round introduces a new binding sub-challenge focused on identifying binding regions within known IDR boundaries. The results indicate that this task remains challenging, highlighting the potential for improvement. The top-performing methods in CAID3 are mostly new and commonly used embeddings from protein language models (pLMs), underscoring the growing impact of pLMs in tackling the complexities of disordered proteins and advancing ID prediction.

Keywords: CAID; DisProt; critical assessment; intrinsic disorder prediction; intrinsically disordered proteins.

PubMed Disclaimer

References

    1. R. van der Lee, M. Buljan, B. Lang, et al., “Classification of Intrinsically Disordered Regions and Proteins,” Chemical Reviews 114 (2014): 6589–6631.
    1. N. Lyle, R. K. Das, and R. V. Pappu, “A Quantitative Measure for Protein Conformational Heterogeneity,” Journal of Chemical Physics 139 (2013): 121907.
    1. M. Bonomi, G. T. Heller, C. Camilloni, and M. Vendruscolo, “Principles of Protein Structural Ensemble Determination,” Current Opinion in Structural Biology 42 (2017): 106–116.
    1. P. Sormanni, D. Piovesan, G. T. Heller, et al., “Simultaneous Quantification of Protein Order and Disorder,” Nature Chemical Biology 13 (2017): 339–342.
    1. H. J. Dyson and P. E. Wright, “Coupling of Folding and Binding for Unstructured Proteins,” Current Opinion in Structural Biology 12 (2002): 54–60.