Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 8;49(D1):D452-D457.
doi: 10.1093/nar/gkaa1097.

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

Affiliations

RepeatsDB in 2021: improved data and extended classification for protein tandem repeat structures

Lisanna Paladin et al. Nucleic Acids Res. .

Abstract

The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new levels (Clan > Family) requiring sequence similarity and describing repeat motifs in collaboration with Pfam. Data growth has been addressed with improved mechanisms for browsing the classification hierarchy. A new UniProt-centric view unifies the increasingly frequent annotation of structures from identical or similar sequences. This update of RepeatsDB aligns with our commitment to develop a resource that extracts, organizes and distributes specialized information on tandem repeat protein structures.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
RepeatsDB classification. The new levels of RepeatsDB classification will discriminate finer structural and functional differences. RepeatsDB topology 4.4 includes beta-propeller regions. The folds in topology 4.4 are distinguished by the number of units (in propellers called ‘blades’), while the clans by the specific secondary structure content and the relative orientation of the blades, as well as the overall shape of the region.
Figure 2.
Figure 2.
(A) RepeatsDB Browse page. This features the classification tree (top) and details of the classification level selected in the tree (bottom). It includes a summary table of the level statistics, image of a representative structure, histogram of unit numbers over per region and a table including all entries belonging to the selected level. (B) UniProt entry page. This shows details of the entry and the consensus repeat annotation (top), Feature Viewer with repeat data for all PDB chains mapped to the UniProt entry (center), PDB section showing repeat data on the sequence and structure of the selected PDB (bottom).

References

    1. Burley S.K., Berman H.M., Bhikadiya C., Bi C., Chen L., Di Costanzo L., Christie C., Dalenberg K., Duarte J.M., Dutta S. et al.. RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy. Nucleic Acids Res. 2019; 47:D464–D474. - PMC - PubMed
    1. Sillitoe I., Dawson N., Lewis T.E., Das S., Lees J.G., Ashford P., Tolulope A., Scholes H.M., Senatorov I., Bujan A. et al.. CATH: expanding the horizons of structure-based functional annotations for genome sequences. Nucleic Acids Res. 2019; 47:D280–D284. - PMC - PubMed
    1. Andreeva A., Kulesha E., Gough J., Murzin A.G.. The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures. Nucleic Acids Res. 2020; 48:D376–D382. - PMC - PubMed
    1. Heringa J. Detection of internal repeats: how common are they. Curr. Opin. Struct. Biol. 1998; 8:338–345. - PubMed
    1. Andrade M.A., Perez-Iratxeta C., Ponting C.P.. Protein repeats: structures, functions, and evolution. J. Struct. Biol. 2001; 134:117–131. - PubMed

Publication types