Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2003 Jan 1;31(1):334-41.
doi: 10.1093/nar/gkg115.

PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification

Affiliations

PANTHER: a browsable database of gene products organized by biological function, using curated protein family and subfamily classification

Paul D Thomas et al. Nucleic Acids Res. .

Erratum in

  • Nucleic Acids Res. 2003 Apr 1;31(7):2024

Abstract

The PANTHER database was designed for high-throughput analysis of protein sequences. One of the key features is a simplified ontology of protein function, which allows browsing of the database by biological functions. Biologist curators have associated the ontology terms with groups of protein sequences rather than individual sequences. Statistical models (Hidden Markov Models, or HMMs) are built from each of these groups. The advantage of this approach is that new sequences can be automatically classified as they become available. To ensure accurate functional classification, HMMs are constructed not only for families, but also for functionally distinct subfamilies. Multiple sequence alignments and phylogenetic trees, including curator-assigned information, are available for each family. The current version of the PANTHER database includes training sequences from all organisms in the GenBank non-redundant protein database, and the HMMs have been used to classify gene products across the entire genomes of human, and Drosophila melanogaster. The ontology terms and protein families and subfamilies, as well as Drosophila gene c;assifications, can be browsed and searched for free. Due to outstanding contractual obligations, access to human gene classifications and to protein family trees and multiple sequence alignments will temporarily require a nominal registration fee. PANTHER is publicly available on the web at http://panther.celera.com.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Browsing the PANTHER database by biological functions. (A) Selection of biological processes under lipid, fatty acid and steroid metabolism (note that categories can be independently selected/deselected, so, for example, steroid metabolism has been deselected). (B) Retrieval of protein families and subfamilies assigned by curators to the selected functional categories. (C) Retrieval of a list of human genes encoding proteins that match the selected family and subfamily HMMs.
Figure 1
Figure 1
Browsing the PANTHER database by biological functions. (A) Selection of biological processes under lipid, fatty acid and steroid metabolism (note that categories can be independently selected/deselected, so, for example, steroid metabolism has been deselected). (B) Retrieval of protein families and subfamilies assigned by curators to the selected functional categories. (C) Retrieval of a list of human genes encoding proteins that match the selected family and subfamily HMMs.
Figure 1
Figure 1
Browsing the PANTHER database by biological functions. (A) Selection of biological processes under lipid, fatty acid and steroid metabolism (note that categories can be independently selected/deselected, so, for example, steroid metabolism has been deselected). (B) Retrieval of protein families and subfamilies assigned by curators to the selected functional categories. (C) Retrieval of a list of human genes encoding proteins that match the selected family and subfamily HMMs.
Figure 2
Figure 2
The PANTHER multiple sequence alignment view, highlighting globally conserved positions (black and gray), and subfamily-specific conservation patterns that may indicate residues important for functional specificity (red). Pfam domains are shown as blue bars, one for each subfamily.
Figure 3
Figure 3
The PANTHER tree-attribute view for verifying curation. (A) The ‘collapsed view’, showing the curator-defined subfamilies and ontology associations. (B) The ‘expanded view’, showing all of the constituent sequences and their annotations.
Figure 3
Figure 3
The PANTHER tree-attribute view for verifying curation. (A) The ‘collapsed view’, showing the curator-defined subfamilies and ontology associations. (B) The ‘expanded view’, showing all of the constituent sequences and their annotations.
Figure 4
Figure 4
Examples of PANTHER subfamilies capturing functional divergence. (A) Laminin-related proteins have divergent domain structures (which correlates with divergence within the shared laminin domain), while (B) Secretin-related GPCRs have divergent sequences within a common domain. Both cases can generally be modelled using subfamily HMMs.
Figure 4
Figure 4
Examples of PANTHER subfamilies capturing functional divergence. (A) Laminin-related proteins have divergent domain structures (which correlates with divergence within the shared laminin domain), while (B) Secretin-related GPCRs have divergent sequences within a common domain. Both cases can generally be modelled using subfamily HMMs.

References

    1. Thomas P.D., Campbell,M.J., Kejariwal,A., Mi,H., Karlak,B., Daverman,R., Diemer,K. and Muruganujan,A. PANTHER: a library of protein families and subfamilies indexed by function, submitted. - PMC - PubMed
    1. Venter J.C., Adams,M.D., Myers,E.W., Li,P.W., Mural,R.J. et al. (2001) The sequence of the human genome. Science, 291, 1304–1351. - PubMed
    1. Mi H., Vandergriff,J., Campbell,M., Narechania,A., Lewis,S., Thomas,P.D. and Ashburner,M. Assessment of genome-wide protein function classification for Drosophila melanogaster, submitted. - PMC - PubMed
    1. Sonnhammer E.L., Eddy,S.R. and Durbin,R. (1997) Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins, 28, 405–420. - PubMed
    1. Schultz J., Milpetz,F., Bork,P. and Ponting,C.P. (1998) SMART, a simple modular architecture research tool: identification of signaling domains. Proc. Natl Acad. Sci. USA, 95, 5857–5864. - PMC - PubMed