PlantTribes2: Tools for comparative gene family analysis in plant genomics
- PMID: 36798801
- PMCID: PMC9928214
- DOI: 10.3389/fpls.2022.1011199
PlantTribes2: Tools for comparative gene family analysis in plant genomics
Abstract
Plant genome-scale resources are being generated at an increasing rate as sequencing technologies continue to improve and raw data costs continue to fall; however, the cost of downstream analyses remains large. This has resulted in a considerable range of genome assembly and annotation qualities across plant genomes due to their varying sizes, complexity, and the technology used for the assembly and annotation. To effectively work across genomes, researchers increasingly rely on comparative genomic approaches that integrate across plant community resources and data types. Such efforts have aided the genome annotation process and yielded novel insights into the evolutionary history of genomes and gene families, including complex non-model organisms. The essential tools to achieve these insights rely on gene family analysis at a genome-scale, but they are not well integrated for rapid analysis of new data, and the learning curve can be steep. Here we present PlantTribes2, a scalable, easily accessible, highly customizable, and broadly applicable gene family analysis framework with multiple entry points including user provided data. It uses objective classifications of annotated protein sequences from existing, high-quality plant genomes for comparative and evolutionary studies. PlantTribes2 can improve transcript models and then sort them, either genome-scale annotations or individual gene coding sequences, into pre-computed orthologous gene family clusters with rich functional annotation information. Then, for gene families of interest, PlantTribes2 performs downstream analyses and customizable visualizations including, (1) multiple sequence alignment, (2) gene family phylogeny, (3) estimation of synonymous and non-synonymous substitution rates among homologous sequences, and (4) inference of large-scale duplication events. We give examples of PlantTribes2 applications in functional genomic studies of economically important plant families, namely transcriptomics in the weedy Orobanchaceae and a core orthogroup analysis (CROG) in Rosaceae. PlantTribes2 is freely available for use within the main public Galaxy instance and can be downloaded from GitHub or Bioconda. Importantly, PlantTribes2 can be readily adapted for use with genomic and transcriptomic data from any kind of organism.
Keywords: CROG analysis; applied agriculture; comparative genomics; galaxy; gene family phylogenetics; genome duplication; modular tools; multiple sequence alignment.
Copyright © 2023 Wafula, Zhang, Von Kuster, Leebens-Mack, Honaas and dePamphilis.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Figures










Similar articles
-
Building a foundation for gene family analysis in Rosaceae genomes with a novel workflow: A case study in Pyrus architecture genes.Front Plant Sci. 2022 Nov 14;13:975942. doi: 10.3389/fpls.2022.975942. eCollection 2022. Front Plant Sci. 2022. PMID: 36452099 Free PMC article.
-
zDB: bacterial comparative genomics made easy.mSystems. 2024 Jul 23;9(7):e0047324. doi: 10.1128/msystems.00473-24. Epub 2024 Jun 28. mSystems. 2024. PMID: 38940522 Free PMC article.
-
Large-Scale Sequencing: The Future of Genomic Sciences? This report is based on a colloquium, sponsored by the American Academy of Microbiology, convened September 2008 in Washington, DC.Washington (DC): American Society for Microbiology; 2009. Washington (DC): American Society for Microbiology; 2009. PMID: 33119235 Free Books & Documents. Review.
-
Comparative assembly hubs: web-accessible browsers for comparative genomics.Bioinformatics. 2014 Dec 1;30(23):3293-301. doi: 10.1093/bioinformatics/btu534. Epub 2014 Aug 18. Bioinformatics. 2014. PMID: 25138168 Free PMC article.
-
Placing human gene families into their evolutionary context.Hum Genomics. 2022 Nov 11;16(1):56. doi: 10.1186/s40246-022-00429-5. Hum Genomics. 2022. PMID: 36369063 Free PMC article. Review.
Cited by
-
Unravelling the temporal dynamics of community functions in protists induced by treated wastewater exposure using metatranscriptomics.Sci Rep. 2025 Jul 4;15(1):23957. doi: 10.1038/s41598-025-10083-1. Sci Rep. 2025. PMID: 40615598 Free PMC article.
-
kakapo: easy extraction and annotation of genes from raw RNA-seq reads.PeerJ. 2023 Nov 27;11:e16456. doi: 10.7717/peerj.16456. eCollection 2023. PeerJ. 2023. PMID: 38034874 Free PMC article.
-
Transcriptomic approach to uncover dynamic events in the development of mid-season sunburn in apple fruit.G3 (Bethesda). 2023 Aug 9;13(8):jkad120. doi: 10.1093/g3journal/jkad120. G3 (Bethesda). 2023. PMID: 37259608 Free PMC article.
-
Peach LAZY1 and DRO1 protein-protein interactions and co-expression with PRAF/RLD family support conserved gravity-related protein interactions across plants.MicroPubl Biol. 2024 Jan 10;2024:10.17912/micropub.biology.000995. doi: 10.17912/micropub.biology.000995. eCollection 2024. MicroPubl Biol. 2024. PMID: 38287925 Free PMC article.
-
Evaluating the stability of nursery-established arbuscular mycorrhizal fungal associations in apple rootstocks.Appl Environ Microbiol. 2025 Jan 31;91(1):e0193724. doi: 10.1128/aem.01937-24. Epub 2024 Dec 10. Appl Environ Microbiol. 2025. PMID: 39655940 Free PMC article.
References
LinkOut - more resources
Full Text Sources
Other Literature Sources