Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 3;40(6):btae370.
doi: 10.1093/bioinformatics/btae370.

RCSB protein Data Bank: exploring protein 3D similarities via comprehensive structural alignments

Affiliations

RCSB protein Data Bank: exploring protein 3D similarities via comprehensive structural alignments

Sebastian Bittrich et al. Bioinformatics. .

Abstract

Motivation: Tools for pairwise alignments between 3D structures of proteins are of fundamental importance for structural biology and bioinformatics, enabling visual exploration of evolutionary and functional relationships. However, the absence of a user-friendly, browser-based tool for creating alignments and visualizing them at both 1D sequence and 3D structural levels makes this process unnecessarily cumbersome.

Results: We introduce a novel pairwise structure alignment tool (rcsb.org/alignment) that seamlessly integrates into the RCSB Protein Data Bank (RCSB PDB) research-focused RCSB.org web portal. Our tool and its underlying application programming interface (alignment.rcsb.org) empowers users to align several protein chains with a reference structure by providing access to established alignment algorithms (FATCAT, CE, TM-align, or Smith-Waterman 3D). The user-friendly interface simplifies parameter setup and input selection. Within seconds, our tool enables visualization of results in both sequence (1D) and structural (3D) perspectives through the RCSB PDB RCSB.org Sequence Annotations viewer and Mol* 3D viewer, respectively. Users can effortlessly compare structures deposited in the PDB archive alongside more than a million incorporated Computed Structure Models coming from the ModelArchive and AlphaFold DB. Moreover, this tool can be used to align custom structure data by providing a link/URL or uploading atomic coordinate files directly. Importantly, alignment results can be bookmarked and shared with collaborators. By bridging the gap between 1D sequence and 3D structures of proteins, our tool facilitates deeper understanding of complex evolutionary relationships among proteins through comprehensive sequence and structural analyses.

Availability and implementation: The alignment tool is part of the RCSB PDB research-focused RCSB.org web portal and available at rcsb.org/alignment. Programmatic access is available via alignment.rcsb.org. Frontend code has been published at github.com/rcsb/rcsb-pecos-app. Visualization is powered by the open-source Mol* viewer (github.com/molstar/molstar and github.com/molstar/rcsb-molstar) plus the Sequence Annotations in 3D Viewer (github.com/rcsb/rcsb-saguaro-3d).

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
Output of the pairwise alignment application. (A) Summary of aligned chains. (B) 3D Structure-based scores (Root-Mean-Square-Deviation or RMSD in Ångstrom Units; TM-score, range 0.00–1.00). (C) Sequence-based scores (% identity, number of aligned residues). (D) Toggle visibility of aligned chain, all polymers, and all ligands via 1st, 2nd, and 3rd box buttons, respectively. (E) Structurally aligned sequence regions in a darker shade, regions aligned only at sequence level in a lighter shade, non-aligned gaps are left empty. (F) 3D visualization of aligned structures. (G) Tools to export and share alignment.

References

    1. Bittrich S, Bhikadiya C, Bi C. et al. RCSB protein data bank: efficient searching and simultaneous access to one million computed structure models alongside the PDB structures enabled by architectural advances. J Mol Biol 2023;435:167994. - PMC - PubMed
    1. Burley SK, Bhikadiya C, Bi C. et al. RCSB protein data bank: tools for visualizing and understanding biological macromolecules in 3D. Protein Sci 2022;31:e4482. - PMC - PubMed
    1. Burley SK, Bhikadiya C, Bi C. et al. RCSB protein data bank (RCSB. org): delivery of experimentally-determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning. Nucleic Acids Res 2023;51:D488–508. - PMC - PubMed
    1. Holm L, Laiho A, Törönen P. et al. DALI shines a light on remote homologs: one hundred discoveries. Protein Sci 2023;32:e4519. - PMC - PubMed
    1. Lafita A, Bliven S, Prlić A. et al. BioJava 5: a community driven open-source bioinformatics library. PLoS Comput Biol 2019;15:e1006791. - PMC - PubMed

Publication types