Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 May 4;5(2):12.
doi: 10.3390/epigenomes5020012.

The EpiDiverse Plant Epigenome-Wide Association Studies (EWAS) Pipeline

Affiliations

The EpiDiverse Plant Epigenome-Wide Association Studies (EWAS) Pipeline

Sultan Nilay Can et al. Epigenomes. .

Abstract

Bisulfite sequencing is a widely used technique for determining DNA methylation and its relationship with epigenetics, genetics, and environmental parameters. Various techniques were implemented for epigenome-wide association studies (EWAS) to reveal meaningful associations; however, there are only very few plant studies available to date. Here, we developed the EpiDiverse EWAS pipeline and tested it using two plant datasets, from P. abies (Norway spruce) and Q. lobata (valley oak). Hence, we present an EWAS implementation tested for non-model plant species and describe its use.

Keywords: DNA methylation; EWAS; GWAS; non-model species; pipeline; plant epigenetics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
The EpiDiverse EWAS pipeline workflow and its interaction with the WGBS, SNP, and DMR EpiDiverse pipelines. Utilized packages or software were specified next to pipeline names. The EpiDiverse epigenome-wide association studies (EWAS) pipeline requires a tab-separated sample.tsv file (shown with purple frame) to specify climatic data and covariate(s) for group determination (can be sampling site, geographical location, or treatment group) and methylation data. As methylation input types, it accepts methylation calls (green arrow) and differentially methylated positions/differentially methylated regions (DMPs/DMRs) (yellow arrow), which can be provided by the whole-genome bisulfite sequencing (WGBS) and the DMR pipelines, respectively. The EWAS pipeline allows running three different models to find epigenetic markers associated with the environment (E), genetic variation (G), or the combination of both (GxE). G and GxE models need single nucleotide polymorphism (SNP) information (red arrow), which can be directly calculated by the SNP pipeline using bisulfite sequencing data, or, as for all other inputs, it can be provided by users. See Figures S1–S4 for more detail. * indicates multiple files with the same extension in a specified directory.
Figure 2
Figure 2
Average-over-region method with the DMRs input type. Overview to show (a) when differentially methylated regions (DMRs) arise from multiple pairwise comparisons between groups (e.g., AB, AC, BC) they are intersected to form distinct, nonoverlapping union DMRs (uDMR) according to a minimum fraction of supporting comparisons provided by the user (e.g., X = 0.5 in this sample). These uDMRs can be merged or taken as independent for further analysis. The resulting uDMRs are intersected with the methylated positions in (b) to derive average methylation levels in each sample for each region, which can then be carried forward as unique identifiers for EWAS. When only a single set of DMRs are provided to the pipeline, the regions are simply taken as is for the averaging process. This averaging process is repeated for all samples.
Figure 3
Figure 3
EpiDiverse EWAS pipeline output directory structure. EpiDiverse EWAS pipeline generates input directory as default and positions directory with methylation calls, DMPs, and regions directory DMRs. Input directory covers gxe.txt, snps.txt, cov.txt, and env.txt files and bed directory with merged bedGraph files as unfiltered and filtered and missing data estimated. Both positions and regions directories have three subdirectories for outputs and graphs with Emodel, Gmodel, and GxE names. Q–Q plots and histograms are produced with all models.
Figure 4
Figure 4
Coalescence analysis between the SNP and methylation data for the CG context. SNP (left) and averaged methylation call values (right) cluster comparison for the CG context. This comparison yielded a 72% topological score indicating a relatively high fraction of clades/branches present in both trees (cf. 3. Methods for details). The thick branches represent deviating topologies. Minus refers to ramets and plus to ortets. Cf. Figures S13–S18 for additional analyses.
Figure 5
Figure 5
Upset plot of significant positions for all models with CG context using locations of trees as covariates with precipitation environmental data. Vertical lines refer to shared terms between classes on the left side. A maximum shared number of terms is between G and GxE models with DMP input type. Overall, 39% of the terms are shared and 61% are unique to single inputs. The highest number of unique elements are found for GxE DMR input with 42,438 terms, and the lowest is with two terms for the Emodel CG MP input.
Figure 6
Figure 6
Subset of BP GO terms related to “water”, “root”, “shoot”, and “defense” per input type under three models (G, GxE, and E), in GC context for precipitation. Several BP GO terms matching “water”, “root”, “shoot”, or “defense” are shown per model and input type. Cells are colored from green = high to red = low.

References

    1. Fuchs J., Demidov D., Houben A., Schubert I. Chromosomal histone modification patterns—From conservation to diversity. Trends Plant Sci. 2006;11:199–208. doi: 10.1016/j.tplants.2006.02.008. - DOI - PubMed
    1. Weinhold B. Epigenetics: The Science of Change. Environ. Heal Perspect. 2006;114:A160–A167. doi: 10.1289/ehp.114-a160. - DOI - PMC - PubMed
    1. Sudan J., Raina M., Singh R. Plant epigenetic mechanisms: Role in abiotic stress and their generational heritability. Biotech. 2018;8:172. doi: 10.1007/s13205-018-1202-6. - DOI - PMC - PubMed
    1. Quadrana L., Colot V. Plant Transgenerational Epigenetics. Annu. Rev. Genet. 2016;50:467–491. doi: 10.1146/annurev-genet-120215-035254. - DOI - PubMed
    1. Weigel D., Colot V. Epialleles in plant evolution. Genome Biol. 2012;13:249. doi: 10.1186/gb-2012-13-10-249. - DOI - PMC - PubMed

LinkOut - more resources