Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan 5;52(D1):D536-D544.
doi: 10.1093/nar/gkad947.

PED in 2024: improving the community deposition of structural ensembles for intrinsically disordered proteins

Collaborators, Affiliations

PED in 2024: improving the community deposition of structural ensembles for intrinsically disordered proteins

Hamidreza Ghafouri et al. Nucleic Acids Res. .

Abstract

The Protein Ensemble Database (PED) (URL: https://proteinensemble.org) is the primary resource for depositing structural ensembles of intrinsically disordered proteins. This updated version of PED reflects advancements in the field, denoting a continual expansion with a total of 461 entries and 538 ensembles, including those generated without explicit experimental data through novel machine learning (ML) techniques. With this significant increment in the number of ensembles, a few yet-unprecedented new entries entered the database, including those also determined or refined by electron paramagnetic resonance or circular dichroism data. In addition, PED was enriched with several new features, including a novel deposition service, improved user interface, new database cross-referencing options and integration with the 3D-Beacons network-all representing efforts to improve the FAIRness of the database. Foreseeably, PED will keep growing in size and expanding with new types of ensembles generated by accurate and fast ML-based generative models and coarse-grained simulations. Therefore, among future efforts, priority will be given to further develop the database to be compatible with ensembles modeled at a coarse-grained level.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Experimental techniques and ensemble generation methods. (A) Matrix layout quantifies the combinations of experimental techniques for PED entries, sorted by size. Filled circles in the matrix indicate which experimental measurement is part of the intersection. (B) Distribution of ensemble generation methods and auxiliary software applied in PED. The X axis represents the number of PED entries.
Figure 2.
Figure 2.
PEDdeposition service workflow. The workflow begins with the submission of an ensemble, which includes the description of both experimental and computational components, the deposition of conformers and the specification of the protein construct via UniProt accessions and/or protein sequence. The next step involves running the validation pipeline to evaluate the uploaded structures and generating insightful statistics through tools such as MolProbity, DSSP and calculating the radius of gyration. At the final stage, the submitted ensemble entry undergoes a final review by biocurators who determine whether to accept or reject it. Ultimately, approved ensembles are subsequently published on PED for public access.

References

    1. Tompa P., Fersht A.. Structure and Function of Intrinsically Disordered Proteins. 2009; CRC Press.
    1. Wang H., Xiong R., Lai L.. Rational drug design targeting intrinsically disordered proteins. WIREs Comput. Mol. Sci. 2016; 11:65–77.
    1. Hatos A., Monzon A.M., Tosatto S.C.E., Piovesan D., Fuxreiter M.. FuzDB: a new phase in understanding fuzzy interactions. Nucleic Acids Res. 2021; 50:D509–D517. - PMC - PubMed
    1. Bonomi M., Heller G.T., Camilloni C., Vendruscolo M.. Principles of protein structural ensemble determination. Curr. Opin. Struct. Biol. 2017; 42:106–116. - PubMed
    1. PDBe-KB consortium PDBe-KB: collaboratively defining the biological context of structural data. Nucleic Acids Res. 2022; 50:D534–D542. - PMC - PubMed

MeSH terms

Substances