Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Feb 1;35(3):518-520.
doi: 10.1093/bioinformatics/bty625.

TreeGrafter: phylogenetic tree-based annotation of proteins with Gene Ontology terms and other annotations

Affiliations

TreeGrafter: phylogenetic tree-based annotation of proteins with Gene Ontology terms and other annotations

Haiming Tang et al. Bioinformatics. .

Abstract

Summary: TreeGrafter is a new software tool for annotating protein sequences using pre-annotated phylogenetic trees. Currently, the tool provides annotations to Gene Ontology (GO) terms, and PANTHER family and subfamily. The approach is generalizable to any annotations that have been made to internal nodes of a reference phylogenetic tree. TreeGrafter takes each input query protein sequence, finds the best matching homologous family in a library of pre-calculated, pre-annotated gene trees, and then grafts it to the best location in the tree. It then annotates the sequence by propagating annotations from ancestral nodes in the reference tree. We show that TreeGrafter outperforms subfamily HMM scoring for correctly assigning subfamily membership, and that it produces highly specific annotations of GO terms based on annotated reference phylogenetic trees. This method will be further integrated into InterProScan, enabling an even broader user community.

Availability and implementation: TreeGrafter is freely available on the web at https://github.com/pantherdb/TreeGrafter, including as a Docker image.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
TreeGrafter annotates each sequence based on where it is grafted onto an annotated reference tree. Given the same tree with pre-annotated ancestral gene nodes (left panel), each query sequence is grafted onto the tree. For the graft position of query 1 (top, blue open circle) there are two annotated ancestral nodes from which query 1 inherits annotations, while for query 2 (bottom, blue open circle), there is only one annotated ancestral node and only the annotations from this one node are inherited by query 2

References

    1. Ashburner M., et al. . (2000) Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. ,25, 25–29. - PMC - PubMed
    1. Burge S., et al. . (2012) Manual GO annotation of predictive protein signatures: the InterPro approach to GO curation. Database (Oxford), 2012, bar068. - PMC - PubMed
    1. Conesa A., et al. . (2005) Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics (Oxford, England), 21, 3674–3676. - PubMed
    1. Gaudet P., et al. . (2011) Phylogenetic-based propagation of functional annotations within the Gene Ontology consortium. Brief. Bioinformatics ,12, 449–462. - PMC - PubMed
    1. Mi H., et al. . (2017) PANTHER version 11: expanded annotation data from Gene Ontology and Reactome pathways, and data analysis tool enhancements. Nucleic Acids Res. ,45, D183–d189. - PMC - PubMed

Publication types