Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jan 7;50(D1):D534-D542.
doi: 10.1093/nar/gkab988.

PDBe-KB: collaboratively defining the biological context of structural data

Collaborators

PDBe-KB: collaboratively defining the biological context of structural data

PDBe-KB consortium. Nucleic Acids Res. .

Abstract

The Protein Data Bank in Europe - Knowledge Base (PDBe-KB, https://pdbe-kb.org) is an open collaboration between world-leading specialist data resources contributing functional and biophysical annotations derived from or relevant to the Protein Data Bank (PDB). The goal of PDBe-KB is to place macromolecular structure data in their biological context by developing standardised data exchange formats and integrating functional annotations from the contributing partner resources into a knowledge graph that can provide valuable biological insights. Since we described PDBe-KB in 2019, there have been significant improvements in the variety of available annotation data sets and user functionality. Here, we provide an overview of the consortium, highlighting the addition of annotations such as predicted covalent binders, phosphorylation sites, effects of mutations on the protein structure and energetic local frustration. In addition, we describe a library of reusable web-based visualisation components and introduce new features such as a bulk download data service and a novel superposition service that generates clusters of superposed protein chains weekly for the whole PDB archive.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Schematic overview of the PDBe-KB infrastructure. PDBe-KB partner resources convert their annotations to a predefined JSON format and transfer these file sets via FTP. Weekly data validation and integration processes parse and load the annotations into the PDBe graph database. A rich set of API endpoints expose the data and power the PDBe-KB aggregated views. Researchers can access the data by setting up a local instance of the graph database, using the API endpoints, or visiting the aggregated view pages.
Figure 2.
Figure 2.
Superposition of protein chains and ligand molecules. The aggregated views of proteins provide access to superposed protein segments and offer a display mode that overlays all the observed ligands on representative chains from superposition clusters. The figure displays the ligand superposition of all the available small molecules in PDB structures of the 3C-like proteinase nsp5 of SARS-CoV-2.
Figure 3.
Figure 3.
Bulk data download service. The aggregated views of proteins provide a graphical user interface to a new bulk data download service which enables researchers to download all the coordinates, sequences and validation data available for a protein of interest.
Figure 4.
Figure 4.
Processed proteins section. The aggregated views of proteins now include a section highlighting all the mature, processed proteins for a polyprotein. In addition, users can click on the green boxes to view the 3D structures using Mol*, and they can navigate to dedicated processed proteins pages by clicking on the ‘view page’ button.
Figure 5.
Figure 5.
Predicted models of a protein of interest. The aggregated views of proteins now provide an overview of available predicted models from data resources such as AlphaFold DB and SWISS-MODEL.
Figure 6.
Figure 6.
Ligand annotations. The aggregated views of proteins now display annotations for ligand molecules based on a cofactor data pipeline. Similarly, we annotate peptides and antibodies in the macromolecular interactions section.

References

    1. Lee D., Redfern O., Orengo C.. Predicting protein function from sequence and structure. Nat. Rev. Mol. Cell Biol. 2007; 8:995–1005. - PubMed
    1. Waman V.P., Sen N., Varadi M., Daina A., Wodak S.J., Zoete V., Velankar S., Orengo C.. The impact of structural bioinformatics tools and resources on SARS-CoV-2 research and therapeutic strategies. Brief. Bioinform. 2021; 22:742–768. - PMC - PubMed
    1. Knott B.C., Erickson E., Allen M.D., Gado J.E., Graham R., Kearns F.L., Pardo I., Topuzlu E., Anderson J.J., Austin H.P.et al. .. Characterization and engineering of a two-enzyme system for plastics depolymerization. Proc. Natl. Acad. Sci. U.S.A. 2020; 117:25476–25485. - PMC - PubMed
    1. Batool M., Ahmad B., Choi S.. A structure-based drug discovery paradigm. Int. J. Mol. Sci. 2019; 20:2783. - PMC - PubMed
    1. Marques S.M., Planas-Iglesias J., Damborsky J.. Web-based tools for computational enzyme design. Curr. Opin. Struct. Biol. 2021; 69:19–34. - PubMed

Publication types

LinkOut - more resources