Three-dimensional Structure Databases of Biological Macromolecules
- PMID: 35507259
- DOI: 10.1007/978-1-0716-2095-3_3
Three-dimensional Structure Databases of Biological Macromolecules
Abstract
Databases of three-dimensional structures of proteins (and their associated molecules) provide: (a) Curated repositories of coordinates of experimentally determined structures, including extensive metadata; for instance information about provenance, details about data collection and interpretation, and validation of results. (b) Information-retrieval tools to allow searching to identify entries of interest and provide access to them. (c) Links among databases, especially to databases of amino-acid and genetic sequences, and of protein function; and links to software for analysis of amino-acid sequence and protein structure, and for structure prediction. (d) Collections of predicted three-dimensional structures of proteins. These will become more and more important after the breakthrough in structure prediction achieved by AlphaFold2. The single global archive of experimentally determined biomacromolecular structures is the Protein Data Bank (PDB). It is managed by wwPDB, a consortium of five partner institutions: the Protein Data Bank in Europe (PDBe), the Research Collaboratory for Structural Bioinformatics (RCSB), the Protein Data Bank Japan (PDBj), the BioMagResBank (BMRB), and the Electron Microscopy Data Bank (EMDB). In addition to jointly managing the PDB repository, the individual wwPDB partners offer many tools for analysis of protein and nucleic acid structures and their complexes, including providing computer-graphic representations. Their collective and individual websites serve as hubs of the community of structural biologists, offering newsletters, reports from Task Forces, training courses, and "helpdesks," as well as links to external software.Many specialized projects are based on the information contained in the PDB. Especially important are SCOP, CATH, and ECOD, which present classifications of protein domains.
Keywords: Data archiving; Domain analysis; Fold classification; Protein Data Bank; Protein structure; Structural biology.
© 2022. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.
References
-
- (1971) Crystallography: Protein data bank. Nature New Biol 233:223
-
- (2021) A celebration of structural biology. Nat Methods 18:427
-
- Bordin N, Sillitoe I, Lees JG, Orengo C (2021) Tracing evolution through protein structures: Nature captured in a few thousand folds. Front Mol Biosci 8:668184
-
- Dayhoff MO, Eck RV et al (1965) Atlas of protein sequence and structure. National Biomedical Research Foundation, Silver Spring, MD
-
- Lipscomb WN, Reeke GN Jr, Hartsuck JA, Quiocho FA, Bethge PH (1970) The structure of carboxypeptidase A. 8. Atomic interpretation at 0.2 nm resolution, a new study of the complex of glycyl-L-tyrosine with CPA, and mechanistic deductions. Philos Trans R Soc Lond B257:177–214
MeSH terms
Substances
LinkOut - more resources
Full Text Sources