Critical Assessment of Protein Intrinsic Disorder Round 3 - Predicting Disorder in the Era of Protein Language Models
- PMID: 40859602
- DOI: 10.1002/prot.70045
Critical Assessment of Protein Intrinsic Disorder Round 3 - Predicting Disorder in the Era of Protein Language Models
Abstract
Intrinsic disorder (ID) in proteins is a complex phenomenon, encompassing a continuum from entirely disordered regions to structured domains with flexible segments. The absence of a ground truth for all forms of disorder, combined with the possibility of structural transitions between ordered and disordered states under specific conditions, makes accurate prediction of ID especially challenging. The Critical Assessment of Protein Intrinsic Disorder (CAID) evaluates ID prediction methods using diverse benchmarks derived from DisProt, a manually curated database of experimentally validated annotations. This paper presents findings from the third (CAID3), in which 24 new methods were assessed along with the predictors from previous rounds. Compared to CAID2, the top-performing methods in CAID3 demonstrated significant gains in average precision: over 31% improvement in predicting linker regions, and 15% in disorder prediction. This round introduces a new binding sub-challenge focused on identifying binding regions within known IDR boundaries. The results indicate that this task remains challenging, highlighting the potential for improvement. The top-performing methods in CAID3 are mostly new and commonly used embeddings from protein language models (pLMs), underscoring the growing impact of pLMs in tackling the complexities of disordered proteins and advancing ID prediction.
Keywords: CAID; DisProt; critical assessment; intrinsic disorder prediction; intrinsically disordered proteins.
© 2025 The Author(s). PROTEINS: Structure, Function, and Bioinformatics published by Wiley Periodicals LLC.
References
-
- R. van der Lee, M. Buljan, B. Lang, et al., “Classification of Intrinsically Disordered Regions and Proteins,” Chemical Reviews 114 (2014): 6589–6631.
-
- N. Lyle, R. K. Das, and R. V. Pappu, “A Quantitative Measure for Protein Conformational Heterogeneity,” Journal of Chemical Physics 139 (2013): 121907.
-
- M. Bonomi, G. T. Heller, C. Camilloni, and M. Vendruscolo, “Principles of Protein Structural Ensemble Determination,” Current Opinion in Structural Biology 42 (2017): 106–116.
-
- P. Sormanni, D. Piovesan, G. T. Heller, et al., “Simultaneous Quantification of Protein Order and Disorder,” Nature Chemical Biology 13 (2017): 339–342.
-
- H. J. Dyson and P. E. Wright, “Coupling of Folding and Binding for Unstructured Proteins,” Current Opinion in Structural Biology 12 (2002): 54–60.