Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Jan;39(Database issue):D420-6.
doi: 10.1093/nar/gkq1001. Epub 2010 Nov 19.

Extending CATH: increasing coverage of the protein structure universe and linking structure with function

Affiliations

Extending CATH: increasing coverage of the protein structure universe and linking structure with function

Alison L Cuff et al. Nucleic Acids Res. 2011 Jan.

Abstract

CATH version 3.3 (class, architecture, topology, homology) contains 128,688 domains, 2386 homologous superfamilies and 1233 fold groups, and reflects a major focus on classifying structural genomics (SG) structures and transmembrane proteins, both of which are likely to add structural novelty to the database and therefore increase the coverage of protein fold space within CATH. For CATH version 3.4 we have significantly improved the presentation of sequence information and associated functional information for CATH superfamilies. The CATH superfamily pages now reflect both the functional and structural diversity within the superfamily and include structural alignments of close and distant relatives within the superfamily, annotated with functional information and details of conserved residues. A significantly more efficient search function for CATH has been established by implementing the search server Solr (http://lucene.apache.org/solr/). The CATH v3.4 webpages have been built using the Catalyst web framework.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
‘CATHerine wheels’. Segments are coloured according to class, namely pink (mainly α), yellow (mainly β), green (αβ) and brown (little secondary structure). The size of each of the segments represents the proportion of structures within any given architecture (inner circle) or fold group (outer circle). (a) The distribution of all non-homologous structures (2386) within CATH v3.3. Superfolds are represented as MOLSCRIPTS adjacent to the wheel. (b) The distribution of the 223 new non-homologous structures in CATH v3.3 (when compared with CATH v3.2).
Figure 2.
Figure 2.
Snapshot of superfamily page for CATH v3.4. Keywords giving information on the functions associated with the superfamily are listed at the top of the page. The smallest and largest domain in the family are displayed to highlight the structural diversity within the family. Pie charts showing the distribution of unique functional terms obtained from Gene3D (FunCAT, KEGG pathways and GO terms) are also displayed; selecting one of these pie charts will take a user to a new functional annotation page giving more information (for example, http://beta.cathdb.info/cathnode/3.40.50.720/function).
Figure 3.
Figure 3.
Correlation between the degree of structural diversity across a superfamily, measured by the number of close structural clusters and population of the superfamily, in terms of number of sequence clusters (at 30% identity) in the genomes.
Figure 4.
Figure 4.
Superfamily comparison plot. Interactive plot which allows the user to compare the structural and functional features of all the superfamilies in CATH though the selection of pull-down menus. The plot displayed here is showing the number of domains for each superfamily against sequence diversity (see http://beta.cathdb.info/cathnode/3.40.50.720/statistics for interactive plot).
Figure 5.
Figure 5.
Snapshot of the multiple structural alignment viewer to be released as part of CATH v3.4. Catalytic and ligand binding residues are retrieved from Wssas and annotated in the alignment. 3D images of superimposed or single domain structures are displayed and annotated in the same way as the multiple structural alignment.

Similar articles

Cited by

References

    1. Orengo CA, Michie AD, Jones S, Jones DT, Swindells MB, Thornton JM. CATH – a hierarchic classification of protein domain structures. Structure. 1997;5:1093–1109. - PubMed
    1. Cuff AL, Sillitoe I, Lewis T, Redfern OC, Garratt R, Thornton J, Orengo CA. The CATH classification revisited—architectures reviewed and new ways to characterize structural divergence in superfamilies. Nucleic Acids Res. 2009;37:D310–D314. - PMC - PubMed
    1. Grabowski M, Joachimiak A, Otwinowski Z, Wladek W. Structural genomics: keeping up with expanding knowledge of the protein universe. Curr. Opin. Struct. Biol. 2007;17:347–353. - PMC - PubMed
    1. Yeats C, Lees J, Reid A, Kellam P, Martin N, Liu X, Orengo C. Gene3D: comprehensive structural and functional annotation of genomes. Nucleic Acids Res. 2008;36:D414–D418. - PMC - PubMed
    1. Greene LH, Lewis TE, Addou S, Cuff A, Dallman T, Dibley M, Redfern O, Pearl F, Nambudiry R, Reid A, et al. The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res. 2007;35:D291–D297. - PMC - PubMed

Publication types