G2GSnake: a Snakemake workflow for host-pathogen genomic association studies
- PMID: 37840906
- PMCID: PMC10576169
- DOI: 10.1093/bioadv/vbad142
G2GSnake: a Snakemake workflow for host-pathogen genomic association studies
Abstract
Summary: Joint analyses of paired host and pathogen genome sequences have the potential to enhance our understanding of host-pathogen interactions. A systematic approach to conduct such a joint analysis is through a "genome-to-genome" (G2G) association study, which involves testing for associations between all host and pathogen genetic variants. Significant associations reveal host genetic factors that might drive pathogen variation, highlighting biological mechanisms likely to be involved in host control and pathogen escape. Here, we present a Snakemake workflow that allows researchers to conduct G2G studies in a reproducible and scalable manner. In addition, we have developed an intuitive R Shiny application that generates custom summaries of the results, enabling users to derive relevant insights.
Availability and implementation: G2GSnake is freely available at: https://github.com/zmx21/G2GSnake under the MIT license.
© The Author(s) 2023. Published by Oxford University Press.
Conflict of interest statement
O.N. is now an employee of SUN bioscience SA.
Figures


Similar articles
-
kGWASflow: a modular, flexible, and reproducible Snakemake workflow for k-mers-based GWAS.G3 (Bethesda). 2023 Dec 29;14(1):jkad246. doi: 10.1093/g3journal/jkad246. G3 (Bethesda). 2023. PMID: 37976215 Free PMC article.
-
Natrix: a Snakemake-based workflow for processing, clustering, and taxonomically assigning amplicon sequencing reads.BMC Bioinformatics. 2020 Nov 16;21(1):526. doi: 10.1186/s12859-020-03852-4. BMC Bioinformatics. 2020. PMID: 33198651 Free PMC article.
-
Mapache: a flexible pipeline to map ancient DNA.Bioinformatics. 2023 Feb 3;39(2):btad028. doi: 10.1093/bioinformatics/btad028. Bioinformatics. 2023. PMID: 36637197 Free PMC article.
-
ATLAS: a Snakemake workflow for assembly, annotation, and genomic binning of metagenome sequence data.BMC Bioinformatics. 2020 Jun 22;21(1):257. doi: 10.1186/s12859-020-03585-4. BMC Bioinformatics. 2020. PMID: 32571209 Free PMC article.
-
crosshap: R package for local haplotype visualization for trait association analysis.Bioinformatics. 2023 Aug 1;39(8):btad518. doi: 10.1093/bioinformatics/btad518. Bioinformatics. 2023. PMID: 37607004 Free PMC article. Review.
References
-
- Aksamentov I, Roemer C, Hodcroft E. et al. Nextclade: clade assignment, mutation calling and quality control for viral genomes. JOSS 2021;6:3773. 10.21105/joss.03773 - DOI
LinkOut - more resources
Full Text Sources