ModelCIF: An Extension of PDBx/mmCIF Data Representation for Computed Structure Models
- PMID: 36828268
- PMCID: PMC10293049
- DOI: 10.1016/j.jmb.2023.168021
ModelCIF: An Extension of PDBx/mmCIF Data Representation for Computed Structure Models
Abstract
ModelCIF (github.com/ihmwg/ModelCIF) is a data information framework developed for and by computational structural biologists to enable delivery of Findable, Accessible, Interoperable, and Reusable (FAIR) data to users worldwide. ModelCIF describes the specific set of attributes and metadata associated with macromolecular structures modeled by solely computational methods and provides an extensible data representation for deposition, archiving, and public dissemination of predicted three-dimensional (3D) models of macromolecules. It is an extension of the Protein Data Bank Exchange / macromolecular Crystallographic Information Framework (PDBx/mmCIF), which is the global data standard for representing experimentally-determined 3D structures of macromolecules and associated metadata. The PDBx/mmCIF framework and its extensions (e.g., ModelCIF) are managed by the Worldwide Protein Data Bank partnership (wwPDB, wwpdb.org) in collaboration with relevant community stakeholders such as the wwPDB ModelCIF Working Group (wwpdb.org/task/modelcif). This semantically rich and extensible data framework for representing computed structure models (CSMs) accelerates the pace of scientific discovery. Herein, we describe the architecture, contents, and governance of ModelCIF, and tools and processes for maintaining and extending the data standard. Community tools and software libraries that support ModelCIF are also described.
Keywords: Computed Structure Models; Data Standard; ModelCIF; PDBx/mmCIF; Protein Structure Prediction.
Copyright © 2023 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures
References
-
- Protein Data Bank. (1971). Crystallography: Protein Data Bank. Nature (London), New Biol. 233, 223–223. - PubMed
-
- Anfinsen CR. (1973). Principles that govern the folding of protein chains. Science. 181, 223–230. - PubMed
-
- Baker D, Sali A. (2001). Protein structure prediction and structural genomics. Science. 294, 93–96. - PubMed
-
- Gobel U, Sander C, Schneider R, Valencia A. (1994). Correlated mutations and residue contacts in proteins. Proteins. 18, 309–317. - PubMed
Publication types
MeSH terms
Substances
Grants and funding
- BB/S020071/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- R01 GM083960/GM/NIGMS NIH HHS/United States
- HHMI/Howard Hughes Medical Institute/United States
- R01 GM109046/GM/NIGMS NIH HHS/United States
- R01 GM133198/GM/NIGMS NIH HHS/United States
- BB/W017970/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- BB/S020144/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- BB/V004247/1/BB_/Biotechnology and Biological Sciences Research Council/United Kingdom
- U01 GM093324/GM/NIGMS NIH HHS/United States
- P41 GM109824/GM/NIGMS NIH HHS/United States
LinkOut - more resources
Full Text Sources
