Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Jan;42(Database issue):D315-9.
doi: 10.1093/nar/gkt1189. Epub 2013 Nov 21.

ArchDB 2014: structural classification of loops in proteins

Affiliations

ArchDB 2014: structural classification of loops in proteins

Jaume Bonet et al. Nucleic Acids Res. 2014 Jan.

Abstract

The function of a protein is determined by its three-dimensional structure, which is formed by regular (i.e. β-strands and α-helices) and non-periodic structural units such as loops. Compared to regular structural elements, non-periodic, non-repetitive conformational units enclose a much higher degree of variability--raising difficulties in the identification of regularities, and yet represent an important part of the structure of a protein. Indeed, loops often play a pivotal role in the function of a protein and different aspects of protein folding and dynamics. Therefore, the structural classification of protein loops is an important subject with clear applications in homology modelling, protein structure prediction, protein design (e.g. enzyme design and catalytic loops) and function prediction. ArchDB, the database presented here (freely available at http://sbi.imim.es/archdb), represents such a resource and has been an important asset for the scientific community throughout the years. In this article, we present a completely reworked and updated version of ArchDB. The new version of ArchDB features a novel, fast and user-friendly web-based interface, and a novel graph-based, computationally efficient, clustering algorithm. The current version of ArchDB classifies 149,134 loops in 5739 classes and 9608 subclasses.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Classification pipeline. Two different methods are applied to build the loop clusters (DS and MCL, see Clustering section and Supplementary Material). Shown within brackets in each subclass is the consensus geometry of the clustered loops, i.e. distance, hoist angle, packing angle and meridian angle [see definitions for loop geometry in the supplementary material, FAQs and in (23)].
Figure 2.
Figure 2.
Distribution of classified loops for each of the clustering method as a function of loop length.
Figure 3.
Figure 3.
RMSD distribution of the five most populated loop lengths (from 0 to 4) for all loop types. Distribution using DS clustering (top). Distribution using MCL clustering (bottom; this includes two types of subclasses 4S and 4M at length 4). See Supplementary Figures S1 and S2 for a detailed analysis of the RMSD distribution by type-length.

References

    1. Garcia-Garcia J, Bonet J, Guney E, Fornes O, Planas-Iglesias J, Oliva B. Networks of protein–protein interactions: from uncertainty to molecular details. Mol. Inform. 2012;31:342–362. - PubMed
    1. Tyagi M, Hashimoto K, Shoemaker BA, Wuchty S, Panchenko AR. Large-scale mapping of human protein interactome using structural complexes. EMBO Rep. 2012;13:266–271. - PMC - PubMed
    1. Andreeva A, Howorth D, Brenner SE, Hubbard TJP, Chothia C, Murzin AG. SCOP database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res. 2004;32:D226– D229. - PMC - PubMed
    1. Lee D, Redfern O, Orengo C. Predicting protein function from sequence and structure. Nat. Rev. Mol. Cell Biol. 2007;8:995–1005. - PubMed
    1. Mosca R, Pons T, Ceol A, Valencia A, Aloy P. Towards a detailed atlas of protein-protein interactions. Curr. Opin. Struct. Biol. 2013;23:929–940. - PubMed

Publication types