Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jan;35(Database issue):D308-13.
doi: 10.1093/nar/gkl910. Epub 2006 Nov 10.

The SUPERFAMILY database in 2007: families and functions

Affiliations

The SUPERFAMILY database in 2007: families and functions

Derek Wilson et al. Nucleic Acids Res. 2007 Jan.

Abstract

The SUPERFAMILY database provides protein domain assignments, at the SCOP 'superfamily' level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from http://supfam.org. The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Summary of the functionality and results that are available as part of the SUPERFAMILY analysis framework via the web interface.
Figure 2
Figure 2
Domain architecture and assignment details for the Ensembl protein ENSP00000315147 from human. Shown are the superfamily and family classification and associated E-values for two domains. Links to further family details, alignments between the SUPERFAMILY model and the protein, assignments for the human genome and domain combinations in which the superfamily domain occur in are included for each domain.

References

    1. Madera M., Vogel C., Kummerfeld S.K., Chothia C., Gough J. The superfamily database in 2004: additions and improvements. Nucleic Acids Res. 2004;32:D235–D239. - PMC - PubMed
    1. Andreeva A., Howorth D., Brenner S.E., Hubbard T.J., Chothia C., Murzin A.G. Scop database in 2004: refinements integrate structure and sequence family data. Nucleic Acids Res. 2004;32:D226–D229. - PMC - PubMed
    1. Wu C.H., Apweiler R., Bairoch A., Natale D.A., Barker W.C., Boeckmann B., Ferro S., Gasteiger E., Huang H., Lopez R., et al. The universal protein resource (uniprot): an expanding universe of protein information. Nucleic Acids Res. 2006;34:D187–D191. - PMC - PubMed
    1. Deshpande N., Addess K.J., Bluhm W.F., Merino-Ott J.C., Townsend-Merino W., Zhang Q., Knezevich C., Xie L., Chen L., Feng Z., et al. The RCSB Protein Data Bank: a redesigned query system and relational database based on the mmCIF schema. Nucleic Acids Res. 2005;33:D233–D237. - PMC - PubMed
    1. Gough J., Karplus K., Hughey R., Chothia C. Assignment of homology to genome sequences using a library of hidden markov models that represent all proteins of known structure. J. Mol. Biol. 2001;313:903–919. - PubMed

Publication types