ModelCIF Update: Supporting Emerging Classes of Computational Macromolecular Models
- PMID: 41580068
- DOI: 10.1016/j.jmb.2026.169658
ModelCIF Update: Supporting Emerging Classes of Computational Macromolecular Models
Abstract
The recent development of highly accurate protein structure prediction tools has led to a rapid expansion in the scope of computational structural biology, enabling a much wider range of modelling studies than ever before. These new in silico opportunities help life science researchers understand how proteins interact with their environment and support design of new molecules with desired properties. Ultimately, they have broad applications, e.g. in medicine, drug discovery or engineering. To ensure reproducibility and to facilitate data exchange and reuse, predicted structures or computed structure models can be stored using ModelCIF, a rich data representation designed to include the atomic coordinates/metadata. The previously published version of ModelCIF (1.4.4; 2022-12-21) mainly covered protein structure predictions generated by homology and ab initio modelling. In this work, we present an extension of the ModelCIF (https://github.com/ihmwg/ModelCIF) data standard and its associated tools. This extension supports important new use cases, including modelling protein-ligand and protein-protein interactions, sampling multiple conformational states and designing proteins de novo. We define guidelines for storage and validation of modelling results for those use cases by applying new and existing ModelCIF categories to capture protocols, inputs and outputs. Additionally, we outline updates to the software tools and resources that implement these new standards and provide functionality for model generation, validation, archiving, and visualisation. By enabling consistent metadata capture across different modelling workflows, this framework aims to support the FAIR dissemination of computational models, thereby promoting reproducibility and reusability in downstream applications.
Keywords: conformational states; data standard; macromolecular structure prediction; protein complexes; protein design.
Copyright © 2026 The Authors. Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
LinkOut - more resources
Full Text Sources
