Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2009:563:123-40.
doi: 10.1007/978-1-60761-175-2_7.

PANTHER pathway: an ontology-based pathway database coupled with data analysis tools

Affiliations

PANTHER pathway: an ontology-based pathway database coupled with data analysis tools

Huaiyu Mi et al. Methods Mol Biol. 2009.

Abstract

The availability of whole genome sequences from various model organisms and increasing experimental data and literatures stimulated the evolution of a systems approach for biological research. The development of computational tools and algorithms to study biological pathway networks has made great progress in helping analyze research data. Pathway databases become an integral part of such an approach. This chapter first discusses how biological knowledge is represented, particularly the importance of ontologies or standards in systems biology research. Next, we use PANTHER Pathway as an example to illustrate how ontologies and standards play a role in data modeling, data entry, and data display. Last, we describe the usage of such systems. We also describe the computational tools that utilize PANTHER Pathway information to analyze gene expression experimental data.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Two views of pathway reactions
(A) and (C). Reaction diagrams that display activity flows of two activation reactions that are commonly used in scientific papers. (B) and (D). Diagrams drawn in CellDesigner, which captures the different molecular mechanisms of the activation reactions illustrated in (A) and (C), respectively.
Figure 2
Figure 2. Phylogenetic tree of a PANTHER protein family
This figure shows an example of a PANTHER protein family PTHR1537, a voltage-gated potassium channel family. The phylogenetic tree was constructed computationally from sequence data. (A) The tree is collapsed to show the subfamilies as leafnodes (blue diamond nodes). Subfamily nodes correspond to common ancestors of extant family members, and are annotated by expert biologists with their inferred molecular functions and roles in biological processes and pathways, based on experiments performed on extant proteins. Each subfamily is represented by a hidden Markov model (HMM) to allow classification of newly discovered protein sequences. (B) A subset of the tree (under red arrow in (A)) is expanded to show an individual training sequence that is used to build a family multisequence alignment, as well as family and subfamily HMMs.
Figure 2
Figure 2. Phylogenetic tree of a PANTHER protein family
This figure shows an example of a PANTHER protein family PTHR1537, a voltage-gated potassium channel family. The phylogenetic tree was constructed computationally from sequence data. (A) The tree is collapsed to show the subfamilies as leafnodes (blue diamond nodes). Subfamily nodes correspond to common ancestors of extant family members, and are annotated by expert biologists with their inferred molecular functions and roles in biological processes and pathways, based on experiments performed on extant proteins. Each subfamily is represented by a hidden Markov model (HMM) to allow classification of newly discovered protein sequences. (B) A subset of the tree (under red arrow in (A)) is expanded to show an individual training sequence that is used to build a family multisequence alignment, as well as family and subfamily HMMs.
Figure 3
Figure 3. Associating pathway molecule classes to subfamilies in a phylogenetic tree.
(A) A molecule class of phospholipase C (PLC, in yellow) in the Angiotensin II-stimulated signaling through G protein and beta-arrestin (PANTHER accession P05911). (A) The phylogenetic tree of phospholipase C subfamilies (PANTHER accession PTHR10336:SF10 -- SF12). A web curation interface is built to allow expert biologists to associate protein training sequences of the tree to the molecule class. For each annotation, a confidence code must be selected from the list defined by the Gene Ontology Consortium and a PubMed identifier of the source of the literature references as evidence.
Figure 3
Figure 3. Associating pathway molecule classes to subfamilies in a phylogenetic tree.
(A) A molecule class of phospholipase C (PLC, in yellow) in the Angiotensin II-stimulated signaling through G protein and beta-arrestin (PANTHER accession P05911). (A) The phylogenetic tree of phospholipase C subfamilies (PANTHER accession PTHR10336:SF10 -- SF12). A web curation interface is built to allow expert biologists to associate protein training sequences of the tree to the molecule class. For each annotation, a confidence code must be selected from the list defined by the Gene Ontology Consortium and a PubMed identifier of the source of the literature references as evidence.
Figure 4
Figure 4. Gene expression data from Compared gene lists tool viewed on PANTHER website.
(A) The results from Compare gene lists tool. Two sample lists were uploaded to the tool, List_A and List_B. The NCBI human gene list was used as the reference list. (B) The results in the Angiogenesis pathway show molecule classes only present in List_A (red), List_B (green), or both (yellow).
Figure 4
Figure 4. Gene expression data from Compared gene lists tool viewed on PANTHER website.
(A) The results from Compare gene lists tool. Two sample lists were uploaded to the tool, List_A and List_B. The NCBI human gene list was used as the reference list. (B) The results in the Angiogenesis pathway show molecule classes only present in List_A (red), List_B (green), or both (yellow).
Figure 5
Figure 5. Gene expression data from Analyze a list of genes with gene expression values tool viewed on PANTHER website.
(A) The output of the tool with a list of P-values for each comparison between a functional category distribution and the reference distribution. (B) Comparison of the distributions from T-cell activation signaling pathway (red) and reference (blue) in graph view. (C) A pathway diagram of T-cell activation signaling pathway that is visualized using an interactive pathway JAVA applet that colors the pathway using a “heat map” derived from the input values.

References

    1. Ashburner M, Mungall CJ, Lewis SE. Ontologies for biologists: a community model for the annotation of genomic data. Cold Spring Harb Symp Quant Biol 2003; 68:227–35. - PubMed
    1. Cho RJ, Campbell MJ. Transcription, genomes, function. Trends Genet 2000; 16 (9):409–15. - PubMed
    1. Clark AG, Glanowski S, Nielsen R et al. Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios. Science 2003; 302 (5652):1960–3. - PubMed
    1. FlyBase Consortium. The FlyBase database of the Drosophila genome projects and community literature. Nucleic Acids Res 2003; 31 (1):172–5. - PMC - PubMed
    1. Fukuda KI, Yamagata Y, Takagi T. FREX: a query interface for biological processes with hierarchical and recursive structures. In Silico Biol 2004; 4 (1):63–79. - PubMed

LinkOut - more resources