Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009 Jan;37(Database issue):D380-6.
doi: 10.1093/nar/gkn762. Epub 2008 Nov 26.

SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny

Affiliations

SUPERFAMILY--sophisticated comparative genomics, data mining, visualization and phylogeny

Derek Wilson et al. Nucleic Acids Res. 2009 Jan.

Abstract

SUPERFAMILY provides structural, functional and evolutionary information for proteins from all completely sequenced genomes, and large sequence collections such as UniProt. Protein domain assignments for over 900 genomes are included in the database, which can be accessed at http://supfam.org/. Hidden Markov models based on Structural Classification of Proteins (SCOP) domain definitions at the superfamily level are used to provide structural annotation. We recently produced a new model library based on SCOP 1.73. Family level assignments are also available. From the web site users can submit sequences for SCOP domain classification; search for keywords such as superfamilies, families, organism names, models and sequence identifiers; find over- and underrepresented families or superfamilies within a genome relative to other genomes or groups of genomes; compare domain architectures across selections of genomes and finally build multiple sequence alignments between Protein Data Bank (PDB), genomic and custom sequences. Recent extensions to the database include InterPro abstracts and Gene Ontology terms for superfamiles, taxonomic visualization of the distribution of families across the tree of life, searches for functionally similar domain architectures and phylogenetic trees. The database, models and associated scripts are available for download from the ftp site.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
TaxViz displays the distribution of domains across the major taxonomic kingdoms, and organisms within each kingdom. Shown here is the distribution of the P-loop containing nucleoside triphosphate hydrolase domains. Each circle or node, represents the features of a single taxonomic group or individual organism. The nodes are arranged hierarchically in concentric rings. The higher taxonomic groups (superkingdoms: Eukaryota, Bacteria and Archaea), located in the centre, lead recursively outwards towards their children (the kingdoms or phyla within each superkingdom). For taxonomic groups, the size of the node increases logarithmically with the mean number of domains found per organism in the taxonomic group. The distribution of domains in individual species can be navigated to using the outer nodes. There are three specialized nodes which display the distribution of domains in (i) selected model organisms; (ii) organisms containing the maximum number of domains; and (iii) organisms containing the minimum number of domains.
Figure 2.
Figure 2.
Phylogenetic tree example. Shows all sequenced Drosophila species, and Primates plotted on the same tree, using the relationships calculated from all genomes.

References

    1. Mulder NJ, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Buillard V, Cerutti L, Copley R, et al. New developments in the InterPro database. Nucleic Acids Res. 2008;35:D224–D228. - PMC - PubMed
    1. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, et al. Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000;25:25–29. - PMC - PubMed
    1. Andreeva A, Howorth D, Chandonia J-M, Brenner SE, Hubbard TJP, Chothia C, Murzin AG. Data growth and its impact on the SCOP database: new developments. Nucleic Acid Res. 2008;36:D419–D425. - PMC - PubMed
    1. Chandonia JM, Hon G, Walker NS, Lo Conte L, Koehl P, Levitt M, Brenner SE. The ASTRAL compendium in 2004. Nucleic Acids Res. 2004;32:D189–D192. - PMC - PubMed
    1. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J. Mol. Biol. 1990;215:403–410. - PubMed

Publication types