Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jan;132(1):17008.
doi: 10.1289/EHP12013. Epub 2024 Jan 31.

The Association between Long-Term DDT or DDE Exposures and an Altered Sperm Epigenome-a Cross-Sectional Study of Greenlandic Inuit and South African VhaVenda Men

Affiliations

The Association between Long-Term DDT or DDE Exposures and an Altered Sperm Epigenome-a Cross-Sectional Study of Greenlandic Inuit and South African VhaVenda Men

Ariane Lismer et al. Environ Health Perspect. 2024 Jan.

Abstract

Background: The organochlorine dichlorodiphenyltrichloroethane (DDT) is banned worldwide owing to its negative health effects. It is exceptionally used as an insecticide for malaria control. Exposure occurs in regions where DDT is applied, as well as in the Arctic, where its endocrine disrupting metabolite, p,p'-dichlorodiphenyldichloroethylene (p,p'-DDE) accumulates in marine mammals and fish. DDT and p,p'-DDE exposures are linked to birth defects, infertility, cancer, and neurodevelopmental delays. Of particular concern is the potential of DDT use to impact the health of generations to come via the heritable sperm epigenome.

Objectives: The objective of this study was to assess the sperm epigenome in relation to p,p'-DDE serum levels between geographically diverse populations.

Methods: In the Limpopo Province of South Africa, we recruited 247 VhaVenda South African men and selected 50 paired blood serum and semen samples, and 47 Greenlandic Inuit blood and semen paired samples were selected from a total of 193 samples from the biobank of the INUENDO cohort, an EU Fifth Framework Programme Research and Development project. Sample selection was based on obtaining a range of p,p'-DDE serum levels (mean=870.734±134.030 ng/mL). We assessed the sperm epigenome in relation to serum p,p'-DDE levels using MethylC-Capture-sequencing (MCC-seq) and chromatin immunoprecipitation followed by sequencing (ChIP-seq). We identified genomic regions with altered DNA methylation (DNAme) and differential enrichment of histone H3 lysine 4 trimethylation (H3K4me3) in sperm.

Results: Differences in DNAme and H3K4me3 enrichment were identified at transposable elements and regulatory regions involved in fertility, disease, development, and neurofunction. A subset of regions with sperm DNAme and H3K4me3 that differed between exposure groups was predicted to persist in the preimplantation embryo and to be associated with embryonic gene expression.

Discussion: These findings suggest that DDT and p,p'-DDE exposure impacts the sperm epigenome in a dose-response-like manner and may negatively impact the health of future generations through epigenetic mechanisms. Confounding factors, such as other environmental exposures, genetic diversity, and selection bias, cannot be ruled out. https://doi.org/10.1289/EHP12013.

PubMed Disclaimer

Figures

Figure 1A is a set of two horizontal bar graphs, plotting DNA methylation loss differentially methylated CpGs, DNA methylation gain differentially methylated CpGs, and all differentially methylated CpGs (y-axis) across Greenlandic population DMC distribution, ranging from 0 to 25,000 in increments of 5,000 and South African population differentially methylated CpGs distribution, ranging from 0 to 35,000 in increments of 5,000 (x-axis) for Dynamic CpG sites. Figure 1B is a scatter plot, plotting beta value–South Africa population, ranging from 0 to 1 in unit increments (y-axis) across beta value-Greenland population, ranging from negative 3 to 2 in unit increments (x-axis) for DNA methylation loss and DNA methylation gain. A Venn diagram has two circles. The circle on the left is labeled Greenland differentially methylated CpGs. It displays the following information: 20,705. The circle on the right is labeled South Africa differentially methylated CpGs. It displays the following information: 31,226. The intersection area is labeled 3,472. Figure 1C is a set of four ribbon plus line graphs titled Greenland DNA methylation gain differentially methylated CpGs, Greenland DNA methylation loss differentially methylated CpGs, South Africa DNA methylation gain differentially methylated CpGs, and South Africa DNA methylation loss differentially methylated CpGs, plotting percentage DNA methylation, ranging from 20 to 70 in increments of 10 (y-axis) across log dose (dichlorodiphenyldichloroethylene nanogram per gram lipids), ranging from 1 to 4 in unit increments, from 1 to 4 in unit increments, 1 to 5 in increments of 2, and 1 to 5 in increments of 2 (x-axis), respectively. Figure 1D is a set of three graphs. On the top, the graph, plotting South African population differentially methylated CpGs, Greenland differentially methylated CpGs, Dynamic CpGs, Capture CpGs (y-axis) across ADARB2 (x-axis). At the bottom, the two graphs is a set of bar and line graphs, plotting South African percentage DNA methylation, ranging from 0 to 100 in increments of 25 and Greenlandic percentage DNA methylation, ranging from 0 to 100 in increments of 25 (y-axis) across ADARB2 (x-axis). Figure 1E is a set of two Manhattan plots titled Greenlandic population epigenomic “hotspots” and South African population epigenomic “hotspots,” plotting differentially methylated CpGs density (percentage) and DMC density (percentage), ranging from 20 to 0 in decrements of 10 and 0 to 40 in increments of 10 (y-axis) across chromosome, ranging from 1 to 22 in unit increments (x-axis) for DNA methylation gain and DNA methylation loss, respectively.
Figure 1.
Association between p,p-dichlorodiphenyldichloroethylene (p,p-DDE) serum levels and human sperm DNA methylation (DNAme). (A) Number of differentially methylated CpGs (DMCs), DNA methylation (DNAme) gain DMCs, and DNAme loss DMCs, in the sperm of DDT-exposed Greenland or South African men. Number of intermediate DMCs (DNAme between 20% and 80%) are indicated by grids and percentages on the bar graphs. See Excel Tables S1 and S2. (B) Scatter plot of overlapping DMCs in Greenland and South African sperm (3,472 overlapping DMCs). Orange dots correspond to DMCs that gain DNAme in both Greenland and South African sperm (2,180 DMCs; 62% of total DMCs). Blue dots correspond to DMCs that lose DNAme in both Greenland and South African sperm (422 DMCs; 12.1% of total DMCs). (C) Average percentage DNAme at DNAme gain or loss DMCs in Greenland or South African sperm relative to log10 serum p,p-dichlorodiphenyldichloroethylene (p,p-DDE) concentration (in ng/mL) for each individual. Linear regression line is plotted in dashed black, and confidence interval is shaded in light gray. See Excel Tables S1 and S2. (D) Tracks at the ADARB2 locus showing percentage DNAme levels in Greenland and South African sperm categorized based on low or high serum p,p-DDE levels (Greenland: low in light blue <350 ng/mL, n=17 and high in dark blue >900 ng/mL, n=18; South Africa: low in yellow <1,200 ng/mL, n=19 and high in brown >14,000 ng/mL, n=16). All CpGs captured by MethylC-Capture-sequencing (MCC-seq) are represented in black, and intermediate CpGs are in gray. CpGs captured in the Greenland sperm data set are in blue, and CpGs captured in the South African sperm data set are in red. (E) Manhattan plots on hotspot analysis for Greenland (blue) or South African (orange) sperm. Cluster analysis was performed by calculating the ratio of DMCs with DNAme gain or loss over the total number of CpGs found within 1 Mb sliding windows over the genome; densities >10% (termed clusters) were extracted for further analysis. See Tables S5 and S6. Note: DDT, dichlorodiphenyltrichloroethane; p,p-DDE, p,p-dichlorodiphenyldichloroethylene.
Figure 2A is a bar graph titled p,p′-dichlorodiphenyldichloroethylene altered merged differentially methylated CpGs, plotting number, ranging from 0 to 40,000 in increments of 10,000 (y-axis) across differentially methylated CpGs, merged differentially methylated CpGs, and single CpGs (x-axis) for Greenland and South Africa. Figure 2B is a set of two pie charts. On the left, the pie chart displays the following information: 6,787 Greenland merged differentially methylated CpGs and 1,187 overlap to South Africa merged differentially methylated CpGs. On the right, the pie chart displays the following information: 12,849 South Africa merged differentially methylated CpGs and 1,189 overlap to Greenland merged differentially methylated CpGs. Figure 2C is a set of two pie charts. On the left, the pie chart titled Greenland merged differentially methylated CpGs displays the following information: 4,928 under DNA methylation gain and 1,859 under DNA methylation loss. On the right, the pie chart is titled South Africa merged differentially methylated CpGs displays the following information: 8,259 under DNA methylation gain and 4,590 under DNA methylation loss. Figure 2D is a set of two clustered bar graphs, plotting proportion of merged differentially methylated CpGs (in percentage), ranging from 0 to 30 in increments of 10 and 0 to 40 in increments of 10 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis). Figures 2E to 2H are scatter dot plots titled Greenland DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation loss merged differentially methylated CpGs, South Africa DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs under Genic, plotting muscle structure development, neurogenesis, positive regulation of GTPase activity, negative regulation of phosphatase activity, regulation of postsynaptic neurotransmitter receptor, axon guidance, homophilic cell adhesion via plasma membrane; peptidyl-lysine methylation, histone H3K4 methylation, signal transduction, regulation of synapse assembly, embryonic skeletal system development, axon guidance, homophilic cell adhesion via plasma membrane; germ cell development, regulation of small GTPase mediated signaling, heart morphogenesis, nervous system development, axon guidance, modulation of chemical synaptic transmission, homophilic cell adhesion via plasma membrane; and sensory perception of sound, receptor internalization, adherens junction organization, regulation of NMDA receptor activity, extracellular matrix organization, negative chemotaxis, and positive regulation of synapse assembly (y-axis) across negative log to the base 2 (weight Fisher), ranging from 10 to 40 in increments of 10; 8 to 12 in increments of 2; 10 to 40 in increments of 10; and 10 to 14 in unit increments (y-axis) for significant gene number, respectively. Figure 2I is a horizontal clustered bar graph titled Transposable elements and persistent regions, plotting simple repeats, L T R-E R V L, L T R-E R V K, L T R, DNA TcMar-Mariner, L T R-E R V 1, P G C escapees, high DNA methylation sperm-to-zygote persistent regions, dyn DNA methylation sperm-to-I C M persistent regions, dyn DNA methylation sperm-to-zygote persistent regions (y-axis) across lowercase z score, ranging from 0 to 80 in increments of 20 (x-axis) for Greenland DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation loss merged differentially methylated CpGs, South Africa DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs. Figure 2J is a heatmap titled L T R-E R V 1, plotting South Africa DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs, and Greenland DNA methylation loss merged differentially methylated CpGs (y-axis) across young, mid-young, mid-old, and old (x-axis). A scale depicts the lowercase z score ranges from negative 1 to 1 in increments of 0.5. Figure 2K is a horizontal clustered bar graph titled Putative enhancers, plotting fetal small intestine, fetal thymus, fetal placenta, fetal lung, fetal heart, fetal stomach, fetal leg muscle, hESC enhancers, hESC super-enhancers, fetal brain, and sperm (y-axis) across lowercase z score, ranging from negative 10 to 10 in increments of 5 (x-axis) for Greenland DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation loss merged differentially methylated CpGs, South Africa DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs.
Figure 2.
Sperm differentially methylated regions association with TEs and regions that retain DNA methylation (DNAme) during embryogenesis. (A) Number of differentially methylated CpGs (DMCs), merged DMCs (mDMCs), and single CpGs in Greenland (dark blue) and South African (light orange) sperm. mDMCs were called by merging DMCs separated by a maximum distance of 500 bp. See Excel Tables S3 and S4. (B) Proportion of Greenlandic sperm mDMCs (6,787) that overlap South African sperm mDMCs (overlap=1,187) and South African sperm mDMCs (overlap=12,849) that overlap Greenlandic sperm mDMCs (overlap=1,189). See Excel Tables S3 and S4. (C) Number of DNAme gain or loss mDMCs in Greenlandic or South African sperm. See Excel Tables S3 and S4. (D) Genomic distribution of mDMCs with DNAme gain or loss in Greenlandic or South African sperm relative to the transcriptional start site (TSS). (E–H) Selected significant pathways from GO analysis on genes at an mDMC with DNAme gain or loss in Greenlandic or South African sperm (weighed Fisher p<0.05). Size of circle corresponds to the number of genes from a significant pathway that overlap an mDMC. See Excel Tables S5–S8. (I) Enrichment of DNAme gain (dark blue for Greenlandic; dark red for South African) or loss (light blue for Greenlandic; bright yellow for South African) mDMCs in Greenlandic (white hashes) or South African (white dots) sperm at identified DNAme embryonic persistent regions [see Figure S3, primordial germ cell (PGC) escapees, and transposable element annotations (RepeatMasker hg19 library 20140131)]. Positive enrichments were determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from the targeted MCC-seq regions. (J) Distribution of DNAme gain (dark blue for Greenlandic; dark red for South African) or loss (light blue for Greenlandic; bright yellow for South African) mDMCs in Greenlandic (white hashes) or South African (white dots) sperm relative to age quarters of LTR-ERV1 TEs (significantly enriched in the four mDMC categories; Figure 2H). Age of LTR-ERV1 TEs was determined by partitioning the TEs’ percentage divergence scores into quarters, where first quarter = low percentage divergence and young LTR-ERV1; second quarter = mid-low percentage divergence and mid-young LTR-ERV1; third quarter = mid-high percentage divergence and mid-old LTR-ERV1; fourth quarter = high percentage divergence and old LTR-ERV1 (see Figure S3B). (K) Enrichment of DNAme gain (dark blue for Greenlandic; dark red for South African) or loss (light blue for Greenlandic; bright yellow for South African) mDMCs in Greenland (white hashes) or South African (white dots) sperm at putative tissue-specific enhancer classes. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from the targeted MCC-seq regions. Note: dyn, dynamic; GO, Gene Ontology; GTPase, guanosine triphosphatase; hESC, human embryonic stem cell; ICM, inner cell mass; MCC-seq, MethylC-Capture-sequencing; NMDA, NMDA, N-methyl-d-aspartate; p,p-DDE, p,p-dichlorodiphenyldichloroethylene; TEs, transposable elements.
Figure 2A is a bar graph titled p,p′-dichlorodiphenyldichloroethylene altered merged differentially methylated CpGs, plotting number, ranging from 0 to 40,000 in increments of 10,000 (y-axis) across differentially methylated CpGs, merged differentially methylated CpGs, and single CpGs (x-axis) for Greenland and South Africa. Figure 2B is a set of two pie charts. On the left, the pie chart displays the following information: 6,787 Greenland merged differentially methylated CpGs and 1,187 overlap to South Africa merged differentially methylated CpGs. On the right, the pie chart displays the following information: 12,849 South Africa merged differentially methylated CpGs and 1,189 overlap to Greenland merged differentially methylated CpGs. Figure 2C is a set of two pie charts. On the left, the pie chart titled Greenland merged differentially methylated CpGs displays the following information: 4,928 under DNA methylation gain and 1,859 under DNA methylation loss. On the right, the pie chart is titled South Africa merged differentially methylated CpGs displays the following information: 8,259 under DNA methylation gain and 4,590 under DNA methylation loss. Figure 2D is a set of two clustered bar graphs, plotting proportion of merged differentially methylated CpGs (in percentage), ranging from 0 to 30 in increments of 10 and 0 to 40 in increments of 10 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis). Figures 2E to 2H are scatter dot plots titled Greenland DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation loss merged differentially methylated CpGs, South Africa DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs under Genic, plotting muscle structure development, neurogenesis, positive regulation of GTPase activity, negative regulation of phosphatase activity, regulation of postsynaptic neurotransmitter receptor, axon guidance, homophilic cell adhesion via plasma membrane; peptidyl-lysine methylation, histone H3K4 methylation, signal transduction, regulation of synapse assembly, embryonic skeletal system development, axon guidance, homophilic cell adhesion via plasma membrane; germ cell development, regulation of small GTPase mediated signaling, heart morphogenesis, nervous system development, axon guidance, modulation of chemical synaptic transmission, homophilic cell adhesion via plasma membrane; and sensory perception of sound, receptor internalization, adherens junction organization, regulation of NMDA receptor activity, extracellular matrix organization, negative chemotaxis, and positive regulation of synapse assembly (y-axis) across negative log to the base 2 (weight Fisher), ranging from 10 to 40 in increments of 10; 8 to 12 in increments of 2; 10 to 40 in increments of 10; and 10 to 14 in unit increments (y-axis) for significant gene number, respectively. Figure 2I is a horizontal clustered bar graph titled Transposable elements and persistent regions, plotting simple repeats, L T R-E R V L, L T R-E R V K, L T R, DNA TcMar-Mariner, L T R-E R V 1, P G C escapees, high DNA methylation sperm-to-zygote persistent regions, dyn DNA methylation sperm-to-I C M persistent regions, dyn DNA methylation sperm-to-zygote persistent regions (y-axis) across lowercase z score, ranging from 0 to 80 in increments of 20 (x-axis) for Greenland DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation loss merged differentially methylated CpGs, South Africa DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs. Figure 2J is a heatmap titled L T R-E R V 1, plotting South Africa DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs, and Greenland DNA methylation loss merged differentially methylated CpGs (y-axis) across young, mid-young, mid-old, and old (x-axis). A scale depicts the lowercase z score ranges from negative 1 to 1 in increments of 0.5. Figure 2K is a horizontal clustered bar graph titled Putative enhancers, plotting fetal small intestine, fetal thymus, fetal placenta, fetal lung, fetal heart, fetal stomach, fetal leg muscle, hESC enhancers, hESC super-enhancers, fetal brain, and sperm (y-axis) across lowercase z score, ranging from negative 10 to 10 in increments of 5 (x-axis) for Greenland DNA methylation gain merged differentially methylated CpGs, Greenland DNA methylation loss merged differentially methylated CpGs, South Africa DNA methylation gain merged differentially methylated CpGs, South Africa DNA methylation loss merged differentially methylated CpGs.
Figure 2.
Sperm differentially methylated regions association with TEs and regions that retain DNA methylation (DNAme) during embryogenesis. (A) Number of differentially methylated CpGs (DMCs), merged DMCs (mDMCs), and single CpGs in Greenland (dark blue) and South African (light orange) sperm. mDMCs were called by merging DMCs separated by a maximum distance of 500 bp. See Excel Tables S3 and S4. (B) Proportion of Greenlandic sperm mDMCs (6,787) that overlap South African sperm mDMCs (overlap=1,187) and South African sperm mDMCs (overlap=12,849) that overlap Greenlandic sperm mDMCs (overlap=1,189). See Excel Tables S3 and S4. (C) Number of DNAme gain or loss mDMCs in Greenlandic or South African sperm. See Excel Tables S3 and S4. (D) Genomic distribution of mDMCs with DNAme gain or loss in Greenlandic or South African sperm relative to the transcriptional start site (TSS). (E–H) Selected significant pathways from GO analysis on genes at an mDMC with DNAme gain or loss in Greenlandic or South African sperm (weighed Fisher p<0.05). Size of circle corresponds to the number of genes from a significant pathway that overlap an mDMC. See Excel Tables S5–S8. (I) Enrichment of DNAme gain (dark blue for Greenlandic; dark red for South African) or loss (light blue for Greenlandic; bright yellow for South African) mDMCs in Greenlandic (white hashes) or South African (white dots) sperm at identified DNAme embryonic persistent regions [see Figure S3, primordial germ cell (PGC) escapees, and transposable element annotations (RepeatMasker hg19 library 20140131)]. Positive enrichments were determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from the targeted MCC-seq regions. (J) Distribution of DNAme gain (dark blue for Greenlandic; dark red for South African) or loss (light blue for Greenlandic; bright yellow for South African) mDMCs in Greenlandic (white hashes) or South African (white dots) sperm relative to age quarters of LTR-ERV1 TEs (significantly enriched in the four mDMC categories; Figure 2H). Age of LTR-ERV1 TEs was determined by partitioning the TEs’ percentage divergence scores into quarters, where first quarter = low percentage divergence and young LTR-ERV1; second quarter = mid-low percentage divergence and mid-young LTR-ERV1; third quarter = mid-high percentage divergence and mid-old LTR-ERV1; fourth quarter = high percentage divergence and old LTR-ERV1 (see Figure S3B). (K) Enrichment of DNAme gain (dark blue for Greenlandic; dark red for South African) or loss (light blue for Greenlandic; bright yellow for South African) mDMCs in Greenland (white hashes) or South African (white dots) sperm at putative tissue-specific enhancer classes. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from the targeted MCC-seq regions. Note: dyn, dynamic; GO, Gene Ontology; GTPase, guanosine triphosphatase; hESC, human embryonic stem cell; ICM, inner cell mass; MCC-seq, MethylC-Capture-sequencing; NMDA, NMDA, N-methyl-d-aspartate; p,p-DDE, p,p-dichlorodiphenyldichloroethylene; TEs, transposable elements.
Figure 3A is a heatmap, plotting 1,865 cases with peak deH3K4me3 in South African sperm (y-axis) across age, body mass index, p,p′-dichlorodiphenyldichloroethylene group, and p,p′-dichlorodiphenyldichloroethylene (x-axis). The age (years) ranges from 18 to 30; The body mass index (kilograms per meter squared) ranges from 18 to 26; p,p′-dichlorodiphenyldichloroethylene group is divided into two parts, namely, ter 1 and ter 3; and p,p′-dichlorodiphenyldichloroethylene (micrograms per milliliter) ranges from 10 to 50. A scale ranges from negative 4 to 4 in increments of 2. Figures 3B and 3C are bar graphs, plotting proportion of peaks (in percentage), ranging from 0 to 60 in increments of 20 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis) for H3K4me3 gain and H3K4me3 loss. Figures 3D and 3E are each a set one line graph and one heatmap. The line graphs plot H3K4me3 gain, ranging from 2.5 to 15.0 in increments of 2.5 and H3K4me3 loss, ranging from 5.0 to 25.0 in increments of 5.0 (y-axis) across ter 1, ter 2, ter 3 (x-axis). The heatmaps plots 851 cases with peak deH3K4me3 gain and 1,014 cases with peak deH3K4me3 loss (y-axis) across negative 5 kilobytes, center, and 5 kilobytes (x-axis). The scale ranges from 5 to 40 in increments of 5. Figures 3F and 3G are line graphs, plotting density, ranging from 0.00 to 1.00 in increments of 0.25 (y-axis) across log to the base 2 counts per minute of counts at peaks with H3K4me3 gain, ranging from negative 2.5 to 5 in increments of 2.5 and log to the base 2 counts per minute of counts at peaks with H3K4me3 loss, ranging from negative 5 to 5 in increments of 5. Figures 3H and 3J are Integrative Genome Viewer graphs, plotting ter 3, ter 2, and ter 1 (y-axis) across Fibrous sheath-interacting protein 1 and Bromodomain Containing 1 (x-axis). Figures 3J and 3K are scatter dot plots, plotting cell adhesion, dendrite morphogenesis, embryonic skeletal system development, neural crest cell development, response to hormone, cell communication, modulation of chemical synaptic transmission, regulation of dendrite morphogenesis, regulation of Wnt signaling pathway, axon guidance; and neurotransmitter secretion, spermatogenesis, blastocyst development, histone acetylation, regulation of postsynaptic neurotransmitter receptor, dosage compensation by inactivation of X chromosome, positive regulation of histone H3K4 methylation, formation of primary germ layer, regulation of histone acetylation, thyroid gland development, regulation of signal transduction by p53 class mediator, cardiac conduction, hindbrain maturation, insulin secretion involved in cellular response to glucose, substantia nigra development, gene silencing (y-axis) across negative log to the base 2 (weightFisher), ranging from 6 to 10 in unit increments and 6 to 10 in increments of 2 (x-axis) for significant, ranging from 20 to 80 in increments of 20 and 10 to 40 in increments of 10, respectively.
Figure 3.
Histone 3 lysine 4 trimethylation (H3K4me3) enrichment in sperm of South African men with different p,p-DDE serum levels. (A) Heatmap of normalized H3K4me3 counts at the 1,865 peaks with differentially enriched H3K4me3 (deH3K4me3; 851 peaks with H3K4me3 gain and 1,014 peaks with H3K4me3 loss; FDR <0.2) in sperm from South African men with low (ter1) or high (ter3) serum p,p-DDE levels. Serum p,p-DDE concentration, tertile, body mass index (BMI), and age of participants is indicated by colored boxes above the heatmap. (B) Genomic distribution of peaks with H3K4me3 gain in South African sperm relative to the TSS. (C) Genomic distribution of peaks with H3K4me3 loss in South African sperm relative to the TSS. (D–E) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 gain in South African sperm (851 peaks) relative to the three p,p-DDE tertile levels. (E) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 loss in South African sperm (1,014 peaks) relative to the three p,p-DDE tertile levels. (F) Estimator of the cumulative distribution function (ECDF) plot for log2 cpm of H3K4me3 counts at regions with H3K4me3 gain in ter1 (light yellow), ter2 (medium orange), and ter3 (dark red) sperm samples. Dose–response trends were assessed by via Kolmogorov–Smirnov tests with p<0.0001 for ter1 vs. ter2 at H3K4me3 gain regions, p<0.05. (G) ECDF plot for log2 cpm of H3K4me3 counts at regions with H3K4me3 gain in ter1 (dark green), ter2 (medium green), and ter3 (light green) sperm samples. Dose–response trends were assessed by via Kolmogorov–Smirnov tests with p<0.0001 for ter1 vs. ter2 at H3K4me3 gain regions, p<0.0001. (H) Representative Integrative Genome Viewer (IGV) tracks of peak with H3K4me3 gain in South African sperm at the FSIP1 genic region. (I) Representative IGV tracks of peak with H3K4me3 loss in South African sperm at the BRD1 promoter. Data for (A–I) are reported in Excel Table S15. Selected significant pathways for (J–K) are from GO analysis on genes with H3K4me3 gain (J) or loss (K) in South African sperm (weighed Fisher p<0.05). Size of circle corresponds to the number of genes from a significant pathway that overlap a peak with H3K4me3 gain. Shade intensity of the circle indicates log2 (weightFisher) value of significant pathway. See Excel Tables S16 and S17. Note: cpm, counts per minute; GO, Gene Ontology; p,p-DDE, p,p-dichlorodiphenyldichloroethylene; ter, tertile; TSS, transcriptional start site.
Figure 3A is a heatmap, plotting 1,865 cases with peak deH3K4me3 in South African sperm (y-axis) across age, body mass index, p,p′-dichlorodiphenyldichloroethylene group, and p,p′-dichlorodiphenyldichloroethylene (x-axis). The age (years) ranges from 18 to 30; The body mass index (kilograms per meter squared) ranges from 18 to 26; p,p′-dichlorodiphenyldichloroethylene group is divided into two parts, namely, ter 1 and ter 3; and p,p′-dichlorodiphenyldichloroethylene (micrograms per milliliter) ranges from 10 to 50. A scale ranges from negative 4 to 4 in increments of 2. Figures 3B and 3C are bar graphs, plotting proportion of peaks (in percentage), ranging from 0 to 60 in increments of 20 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis) for H3K4me3 gain and H3K4me3 loss. Figures 3D and 3E are each a set one line graph and one heatmap. The line graphs plot H3K4me3 gain, ranging from 2.5 to 15.0 in increments of 2.5 and H3K4me3 loss, ranging from 5.0 to 25.0 in increments of 5.0 (y-axis) across ter 1, ter 2, ter 3 (x-axis). The heatmaps plots 851 cases with peak deH3K4me3 gain and 1,014 cases with peak deH3K4me3 loss (y-axis) across negative 5 kilobytes, center, and 5 kilobytes (x-axis). The scale ranges from 5 to 40 in increments of 5. Figures 3F and 3G are line graphs, plotting density, ranging from 0.00 to 1.00 in increments of 0.25 (y-axis) across log to the base 2 counts per minute of counts at peaks with H3K4me3 gain, ranging from negative 2.5 to 5 in increments of 2.5 and log to the base 2 counts per minute of counts at peaks with H3K4me3 loss, ranging from negative 5 to 5 in increments of 5. Figures 3H and 3J are Integrative Genome Viewer graphs, plotting ter 3, ter 2, and ter 1 (y-axis) across Fibrous sheath-interacting protein 1 and Bromodomain Containing 1 (x-axis). Figures 3J and 3K are scatter dot plots, plotting cell adhesion, dendrite morphogenesis, embryonic skeletal system development, neural crest cell development, response to hormone, cell communication, modulation of chemical synaptic transmission, regulation of dendrite morphogenesis, regulation of Wnt signaling pathway, axon guidance; and neurotransmitter secretion, spermatogenesis, blastocyst development, histone acetylation, regulation of postsynaptic neurotransmitter receptor, dosage compensation by inactivation of X chromosome, positive regulation of histone H3K4 methylation, formation of primary germ layer, regulation of histone acetylation, thyroid gland development, regulation of signal transduction by p53 class mediator, cardiac conduction, hindbrain maturation, insulin secretion involved in cellular response to glucose, substantia nigra development, gene silencing (y-axis) across negative log to the base 2 (weightFisher), ranging from 6 to 10 in unit increments and 6 to 10 in increments of 2 (x-axis) for significant, ranging from 20 to 80 in increments of 20 and 10 to 40 in increments of 10, respectively.
Figure 3.
Histone 3 lysine 4 trimethylation (H3K4me3) enrichment in sperm of South African men with different p,p-DDE serum levels. (A) Heatmap of normalized H3K4me3 counts at the 1,865 peaks with differentially enriched H3K4me3 (deH3K4me3; 851 peaks with H3K4me3 gain and 1,014 peaks with H3K4me3 loss; FDR <0.2) in sperm from South African men with low (ter1) or high (ter3) serum p,p-DDE levels. Serum p,p-DDE concentration, tertile, body mass index (BMI), and age of participants is indicated by colored boxes above the heatmap. (B) Genomic distribution of peaks with H3K4me3 gain in South African sperm relative to the TSS. (C) Genomic distribution of peaks with H3K4me3 loss in South African sperm relative to the TSS. (D–E) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 gain in South African sperm (851 peaks) relative to the three p,p-DDE tertile levels. (E) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 loss in South African sperm (1,014 peaks) relative to the three p,p-DDE tertile levels. (F) Estimator of the cumulative distribution function (ECDF) plot for log2 cpm of H3K4me3 counts at regions with H3K4me3 gain in ter1 (light yellow), ter2 (medium orange), and ter3 (dark red) sperm samples. Dose–response trends were assessed by via Kolmogorov–Smirnov tests with p<0.0001 for ter1 vs. ter2 at H3K4me3 gain regions, p<0.05. (G) ECDF plot for log2 cpm of H3K4me3 counts at regions with H3K4me3 gain in ter1 (dark green), ter2 (medium green), and ter3 (light green) sperm samples. Dose–response trends were assessed by via Kolmogorov–Smirnov tests with p<0.0001 for ter1 vs. ter2 at H3K4me3 gain regions, p<0.0001. (H) Representative Integrative Genome Viewer (IGV) tracks of peak with H3K4me3 gain in South African sperm at the FSIP1 genic region. (I) Representative IGV tracks of peak with H3K4me3 loss in South African sperm at the BRD1 promoter. Data for (A–I) are reported in Excel Table S15. Selected significant pathways for (J–K) are from GO analysis on genes with H3K4me3 gain (J) or loss (K) in South African sperm (weighed Fisher p<0.05). Size of circle corresponds to the number of genes from a significant pathway that overlap a peak with H3K4me3 gain. Shade intensity of the circle indicates log2 (weightFisher) value of significant pathway. See Excel Tables S16 and S17. Note: cpm, counts per minute; GO, Gene Ontology; p,p-DDE, p,p-dichlorodiphenyldichloroethylene; ter, tertile; TSS, transcriptional start site.
Figure 4A is a bar graph titled Peaks with H3K4me3 gain, plotting L T R-E R V 1, L T R E R V L-Ma L R, DNA repeats, and L I N E-1 (y-axis) across lowercase z score, ranging from negative 2.5 to 7.5 in increments of 2.5 (x-axis). Figure 4B is a bar graph titled Peaks with H3K4me3 loss, plotting L T R, L T R E R V L-Ma L R, L I N E-1, S I N E F L A M-A, metastable epialleles, S I N E F R A M, L I N E-2, S I N E M I R, S I N E Alu, S I N E F L A M-C, simple repeat, low complexity (y-axis) across lowercase z score, ranging from 0 to 10 (x-axis). Figures 4C and 4D are heatmaps, plotting L T R E R V L-Ma L R and L I N E-1; S I N E-Alu, S I N E-M I R, and L I N E-2 (y-axis) across young, mid-young, mid-old, old (x-axis). A scale depicts lowercase z score ranging from negative 1 to 1 in increments of 0.5. Figures 4E and 4F are bar graphs, plotting Proportion of peaks (in percentage), ranging from 0 to 40 in increments of 10 and 0 to 60 increments of 10 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis). Figure 4G is a set of three scatter dot plots titled L I N E 2, S I N E-M I R, S I N E-Alu under Overrepresented young TEs at peaks with H3K4me3 loss, plotting chaperone-mediated protein folding, intracellular protein transport, autophagy, regulation of telomerase activity, RNA modification, lipid metabolic process, substantia nigra development, and DNA replication-dependent nucleosome assembly; miRNA mediated inhibition of translation, response to virus, establishment of planar polarity, RNA processing, regulation of NMDA receptor activity, regulation of apoptotic process, cardiac muscle cell differentiation, 7-methylguanosine mRNA capping; and positive regulation of H3K4 methylation, gene silencing, retina development in camera-type eye, face development, regulation of histone acetylation, cardiac muscle membrane repolarization, embryonic skeletal system morphogenesis, positive regulation of viral transcription (y-axis) across negative log to the base 2 (weightFisher), ranging from 5 to 10 in increments of 2.5 (x-axis) for significant, ranging from 10 to 30 in increments of 10, 10 to 30 in increments of 10, and 5 to 25 in increments of 5, respectively. A scale ranging negative log to the base 2 (weightFisher) ranges from 5 to 8 in unit increments, 5.5 to 7.5 in increments of 1.5, and 5 to 6 in unit increments. Figure 4H is a set of two horizontal bar graphs titled Putative enhancers, plotting hESC A, fetal stomach, fetal thymus, hESC B, fetal placenta, trophoblast, osteoblast, fetal small intestine, hESC neuron, fetal leg muscle, fetal heart, fetal brain, fetal spinal cord, hESC NPC, fetal lung, mesendoderm, hESC super-enhancers, sperm; and sperm, mesendoderm, fetal lung, hESC NPC, fetal spinal cord, trophoblast, fetal placenta, osteoblast, fetal small intestine, fetal leg muscle, hESC B, fetal brain, fetal stomach, fetal heart, fetal thymus, hESC neuron, hESC A (y-axis) across lowercase z score, negative 10 to 10 in increments of 5 and negative 10 to 20 in increments of 10 (x-axis) for Peaks with H3K4me3 gain and Peaks with H3K4me3 loss. Figures 4I and 4J are each a set of one line graph and heatmaps. The line graphs plot H3K4me3 gain, ranging from 0 to 25 in increments of 5 and H3K4me3 loss, ranging from 10 to 50 in increments of 10 (y-axis) across sperm, 4-cell, 8-cell, and I C M (x-axis). The heatmaps plots 851 cases with peak deH3K4me3 gain and 1,014 cases with peak deH3K4me3 loss (y-axis) across negative 5 kilobytes, center, and positive 5 kilobytes (x-axis). The scale ranges from 0 to 80 in increments of 20, 0 to 30 in increments of 15, 0 to 120 in increments of 40, and 0 to 120 in increments of 40. Figures 4K to 4M are scatter plots titled 4-cell, 8 cell, and ICM under Promoters with H3K4me3 loss and expressed genes in embryos (R P K M greater than 1), plotting log to the base 2 of (sperm H3K4me3 promoter counts plus 1), ranging from 10 to 18 in increments of 2 (y-axis) across log to the base 2 of (4-cell H3K4me3 promoter counts plus 1), ranging from 5 to 10 in increments of 2.5 (x-axis), respectively. A scale depicts the log to the base 2 of (4-cell R P K M plus 1); log to the base 2 of (8-cell R P K M plus 1); and log to the base 2 of (I C M R P K M plus 1), each ranging from 0 to 6.
Figure 4.
Overlap between peaks with H3K4me3 loss in South African sperm, TEs, and regions that retain H3K4me3 in the preimplantation embryo. (A) Enrichment for peaks with H3K4me3 gain in South African sperm at transposable element annotations (RepeatMasker hg19 library 20140131). Positive and negative enrichments were determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from sperm H3K4me3 peaks. (B) Enrichment for peaks with H3K4me3 loss in South African sperm at transposable element annotations (RepeatMasker hg19 library 20140131). Positive and negative enrichments were determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from sperm H3K4me3 peaks. (C) Distribution for peaks with H3K4me3 gain in South African sperm relative to age quarters of LINE-1 and LTR ERV-MaLR transposable element classes [significantly enriched for peaks with H3K4me3 gain (A)]. Age of TEs was determined by partitioning the class percentage divergence score in quarters (see Figure S4G,H). (D) Distribution for peaks with H3K4me3 loss in South African sperm relative to age quarters of LINE-2, SINE-MIR, and SINE-Alu transposable element classes [significantly enriched for peaks with H3K4me3 loss (B)]. Age of TEs was determined by partitioning the class percentage divergence score in quarters (see Figure S4I–K). (E) Genomic distribution of peaks with H3K4me3 gain in South African sperm that overlap an old (4th quarter) LINE-1 (139) or LTR ERVL-MaLR (74) transposable element, relative to the TSS. (F) Genomic distribution of peaks with H3K4me3 loss in South African sperm that overlap a young (1st quarter) LINE-2 (187), SINE-MIR (267), or SINE-Alu (507) transposable element, relative to the TSS. (G) Selected significant pathways from GO analysis on promoter peaks with H3K4me3 loss in sperm that overlap a young (1st quarter) LINE-2 (324 promoters), SINE-MIR (442 promoters), or SINE-Alu (809 promoters) TEs at (weighed Fisher p<0.05). Size of dots corresponds to the number of genes from a significant pathway that overlap a peak with H3K4me3 loss. Color of the dots indicates log2 (weightFisher) value of significant pathway. See Excel Tables S18–S20. (H) Enrichment for peaks with H3K4me3 gain (yellow) or loss (green) in South African sperm at tissue-specific putative enhancer annotations. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from sperm H3K4me3 peaks. Data are reported in Excel Table S19. (I) Sperm and preimplantation embryo (4-cell, 8-cell, ICM) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 gain in South African sperm (851 peaks). Data are reported in Excel Table S20. (J) Sperm and preimplantation embryo (4-cell, 8-cell, ICM) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 loss in South African sperm (1,014 peaks). (K–M) Scatter plots where the x-axis corresponds to the log2 (preimplantation embryo H3K4me3 promoter counts +1) and the y-axis corresponds to the log2 (sperm H3K4me3 promoter counts +1) at promoters with H3K4me3 loss in South African sperm that are expressed at the described stages of preimplantation embryo development (RPKM >1). Color of the scatter points correspond to the log2 preimplantation embryo RPKM gene expression +1. Dashed lines correspond to H3K4me3 promoter density cutoffs for preimplantation embryo (x-axis) or sperm (y-axis). Gray box denotes promoters with H3K4me3 in sperm and 4-cell (K), 8-cell (L), and ICM (M) preimplantation embryos. Note: GO, Gene Ontology; H3K4me3, histone H3 lysine 4 trimethylation; hESC, human embryonic stem cell; ICM, inner cell mass; NMDA, N-methyl-d-aspartate; NPC, neural progenitor cell; RPKM, reads per kilobase per million mapped reads; TEs, transposable elements; TSS, transcriptional start site.
Figure 4A is a bar graph titled Peaks with H3K4me3 gain, plotting L T R-E R V 1, L T R E R V L-Ma L R, DNA repeats, and L I N E-1 (y-axis) across lowercase z score, ranging from negative 2.5 to 7.5 in increments of 2.5 (x-axis). Figure 4B is a bar graph titled Peaks with H3K4me3 loss, plotting L T R, L T R E R V L-Ma L R, L I N E-1, S I N E F L A M-A, metastable epialleles, S I N E F R A M, L I N E-2, S I N E M I R, S I N E Alu, S I N E F L A M-C, simple repeat, low complexity (y-axis) across lowercase z score, ranging from 0 to 10 (x-axis). Figures 4C and 4D are heatmaps, plotting L T R E R V L-Ma L R and L I N E-1; S I N E-Alu, S I N E-M I R, and L I N E-2 (y-axis) across young, mid-young, mid-old, old (x-axis). A scale depicts lowercase z score ranging from negative 1 to 1 in increments of 0.5. Figures 4E and 4F are bar graphs, plotting Proportion of peaks (in percentage), ranging from 0 to 40 in increments of 10 and 0 to 60 increments of 10 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis). Figure 4G is a set of three scatter dot plots titled L I N E 2, S I N E-M I R, S I N E-Alu under Overrepresented young TEs at peaks with H3K4me3 loss, plotting chaperone-mediated protein folding, intracellular protein transport, autophagy, regulation of telomerase activity, RNA modification, lipid metabolic process, substantia nigra development, and DNA replication-dependent nucleosome assembly; miRNA mediated inhibition of translation, response to virus, establishment of planar polarity, RNA processing, regulation of NMDA receptor activity, regulation of apoptotic process, cardiac muscle cell differentiation, 7-methylguanosine mRNA capping; and positive regulation of H3K4 methylation, gene silencing, retina development in camera-type eye, face development, regulation of histone acetylation, cardiac muscle membrane repolarization, embryonic skeletal system morphogenesis, positive regulation of viral transcription (y-axis) across negative log to the base 2 (weightFisher), ranging from 5 to 10 in increments of 2.5 (x-axis) for significant, ranging from 10 to 30 in increments of 10, 10 to 30 in increments of 10, and 5 to 25 in increments of 5, respectively. A scale ranging negative log to the base 2 (weightFisher) ranges from 5 to 8 in unit increments, 5.5 to 7.5 in increments of 1.5, and 5 to 6 in unit increments. Figure 4H is a set of two horizontal bar graphs titled Putative enhancers, plotting hESC A, fetal stomach, fetal thymus, hESC B, fetal placenta, trophoblast, osteoblast, fetal small intestine, hESC neuron, fetal leg muscle, fetal heart, fetal brain, fetal spinal cord, hESC NPC, fetal lung, mesendoderm, hESC super-enhancers, sperm; and sperm, mesendoderm, fetal lung, hESC NPC, fetal spinal cord, trophoblast, fetal placenta, osteoblast, fetal small intestine, fetal leg muscle, hESC B, fetal brain, fetal stomach, fetal heart, fetal thymus, hESC neuron, hESC A (y-axis) across lowercase z score, negative 10 to 10 in increments of 5 and negative 10 to 20 in increments of 10 (x-axis) for Peaks with H3K4me3 gain and Peaks with H3K4me3 loss. Figures 4I and 4J are each a set of one line graph and heatmaps. The line graphs plot H3K4me3 gain, ranging from 0 to 25 in increments of 5 and H3K4me3 loss, ranging from 10 to 50 in increments of 10 (y-axis) across sperm, 4-cell, 8-cell, and I C M (x-axis). The heatmaps plots 851 cases with peak deH3K4me3 gain and 1,014 cases with peak deH3K4me3 loss (y-axis) across negative 5 kilobytes, center, and positive 5 kilobytes (x-axis). The scale ranges from 0 to 80 in increments of 20, 0 to 30 in increments of 15, 0 to 120 in increments of 40, and 0 to 120 in increments of 40. Figures 4K to 4M are scatter plots titled 4-cell, 8 cell, and ICM under Promoters with H3K4me3 loss and expressed genes in embryos (R P K M greater than 1), plotting log to the base 2 of (sperm H3K4me3 promoter counts plus 1), ranging from 10 to 18 in increments of 2 (y-axis) across log to the base 2 of (4-cell H3K4me3 promoter counts plus 1), ranging from 5 to 10 in increments of 2.5 (x-axis), respectively. A scale depicts the log to the base 2 of (4-cell R P K M plus 1); log to the base 2 of (8-cell R P K M plus 1); and log to the base 2 of (I C M R P K M plus 1), each ranging from 0 to 6.
Figure 4.
Overlap between peaks with H3K4me3 loss in South African sperm, TEs, and regions that retain H3K4me3 in the preimplantation embryo. (A) Enrichment for peaks with H3K4me3 gain in South African sperm at transposable element annotations (RepeatMasker hg19 library 20140131). Positive and negative enrichments were determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from sperm H3K4me3 peaks. (B) Enrichment for peaks with H3K4me3 loss in South African sperm at transposable element annotations (RepeatMasker hg19 library 20140131). Positive and negative enrichments were determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from sperm H3K4me3 peaks. (C) Distribution for peaks with H3K4me3 gain in South African sperm relative to age quarters of LINE-1 and LTR ERV-MaLR transposable element classes [significantly enriched for peaks with H3K4me3 gain (A)]. Age of TEs was determined by partitioning the class percentage divergence score in quarters (see Figure S4G,H). (D) Distribution for peaks with H3K4me3 loss in South African sperm relative to age quarters of LINE-2, SINE-MIR, and SINE-Alu transposable element classes [significantly enriched for peaks with H3K4me3 loss (B)]. Age of TEs was determined by partitioning the class percentage divergence score in quarters (see Figure S4I–K). (E) Genomic distribution of peaks with H3K4me3 gain in South African sperm that overlap an old (4th quarter) LINE-1 (139) or LTR ERVL-MaLR (74) transposable element, relative to the TSS. (F) Genomic distribution of peaks with H3K4me3 loss in South African sperm that overlap a young (1st quarter) LINE-2 (187), SINE-MIR (267), or SINE-Alu (507) transposable element, relative to the TSS. (G) Selected significant pathways from GO analysis on promoter peaks with H3K4me3 loss in sperm that overlap a young (1st quarter) LINE-2 (324 promoters), SINE-MIR (442 promoters), or SINE-Alu (809 promoters) TEs at (weighed Fisher p<0.05). Size of dots corresponds to the number of genes from a significant pathway that overlap a peak with H3K4me3 loss. Color of the dots indicates log2 (weightFisher) value of significant pathway. See Excel Tables S18–S20. (H) Enrichment for peaks with H3K4me3 gain (yellow) or loss (green) in South African sperm at tissue-specific putative enhancer annotations. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions (of the same size) resampled from sperm H3K4me3 peaks. Data are reported in Excel Table S19. (I) Sperm and preimplantation embryo (4-cell, 8-cell, ICM) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 gain in South African sperm (851 peaks). Data are reported in Excel Table S20. (J) Sperm and preimplantation embryo (4-cell, 8-cell, ICM) H3K4me3 signal intensity heatmaps at ±5 kb the center of the peaks with H3K4me3 loss in South African sperm (1,014 peaks). (K–M) Scatter plots where the x-axis corresponds to the log2 (preimplantation embryo H3K4me3 promoter counts +1) and the y-axis corresponds to the log2 (sperm H3K4me3 promoter counts +1) at promoters with H3K4me3 loss in South African sperm that are expressed at the described stages of preimplantation embryo development (RPKM >1). Color of the scatter points correspond to the log2 preimplantation embryo RPKM gene expression +1. Dashed lines correspond to H3K4me3 promoter density cutoffs for preimplantation embryo (x-axis) or sperm (y-axis). Gray box denotes promoters with H3K4me3 in sperm and 4-cell (K), 8-cell (L), and ICM (M) preimplantation embryos. Note: GO, Gene Ontology; H3K4me3, histone H3 lysine 4 trimethylation; hESC, human embryonic stem cell; ICM, inner cell mass; NMDA, N-methyl-d-aspartate; NPC, neural progenitor cell; RPKM, reads per kilobase per million mapped reads; TEs, transposable elements; TSS, transcriptional start site.
Figure 5A is set of six donut charts titled DNA methylation and H3K4me3 overlaps. From left to right, the first donut chart displays the following information: 830,188 cases of M C C-sequence target regions and 157,753 cases of overlap to H3K4me3 peaks. The second donut chart displays the following information: 48,499 cases of H3K4me3 peaks and 34,741 cases of overlap to M C C-sequence target regions. The third donut chart displays the following information: 12,849 cases of South Africa merged differentially methylated CpGs and 1,744 cases of overlap to H3K4me3 peaks. The fourth donut chart displays the following information: 48,499 cases of H3K4me3 peaks and 1,618 cases of overlap to South Africa merged differentially methylated CpGs. The fifth donut chart displays the following information: 12,849 cases of South Africa merged differentially methylated CpGs and 106 cases of overlap of South Africa deH3K4me3 peaks. The sixth donut chart displays the following information: 1,855 cases of South Africa deH3K4me3 peaks and 99 cases of overlap to South Africa merged differentially methylated CpGs. Figure 5B is a scatter plot, plotting South Africa population ter 3 to ter 1, log to the base 2-fold change (H3K4me3 counts at overlap plus 1), ranging from negative 0.2 to 0.2 in increments of 0.1 (y-axis) across effect size–South Africa population, ranging from negative 1 to 1 in increments of (x-axis) for H3K4me3 gain, DNA methylation loss and H3K4me3 loss, DNA methylation gain. Figure 5C is a bar graph, plotting proportions of overlap (in percentage), ranging from 0 to 60 in increments of 20 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis) for H3K4me3 loss peak overlapped to DNA methylation gain merged differentially methylated CpGs and DNA methylation gain merged differentially methylated CpGs overlapped to H3K4me3 loss peak. Figures 5D and 5E are horizontal bar graphs, plotting CpG islands, CpG shelves, 3’UTRs, promoters, 5′UTRs, exons, first exons, CpG shores, 1 to 5 kilobytes; and intermediate DNA methylation sperm-to-zygote persistent regions, low DNA methylation sperm-to-ICM persistent regions, low DNA methylation sperm-to-zygote persistent regions, SINE MIRc, hESC enhancers, H3K4me3 sperm-to-ICM persistent regions, H3K4me3 sperm-to-4-cell persistent regions (y-axis) across lowercase z score, ranging from 0 to 10 in increments of 5 and 0 to 20 in increments of 10 (x-axis) for H3K4me3 loss peak overlapped to DNA methylation gain merged differentially methylated CpGs and DNA methylation gain merged differentially methylated CpGs overlapped to H3K4me3 loss peak. Figure 5F is a scatter dot plot, plotting nervous system development, immune response, embryonic organ development, transcription by RNA polymerase 2, RNA processing, cilium movement, cellular process involved in reproduction, cellular macromolecule catabolic process, neurogenesis, regulation of cilium assembly, spermatid development, regulation of nucleic acid-templated transcription, double-strand break repair, neutrophil mediated immunity, meiosis 1, and posttranslational protein modification (y-axis) across negative log to the base 2 (weightFisher), ranging from 1 to 5 in unit increments (x-axis) for significant, ranging from 2 to 6 in increments of 2. The scale depicts the negative log to the base 2 (weightFisher), ranging from 2 and 4 in unit increments. Figure 5G is a set of four Representative Integrative Genome Viewer tracks, plotting Int CpGs, including MCC-seq and H3K4me3 ChIP-seq, each for ter 3, ter 2, and ter 1; MCC-seq and H3K4me3 ChIP-seq, each ranging as ter 3, ter 2, and ter 1; MCC-seq, ranging as ter 3, ter 2, and ter 1; and MCC-seq and H3K4me3 ChIP-seq, ranging as ter 3, ter 2, and ter 1 (y-axis) across L T R 43 (L T R-E R V 1) and M E R 5 A (DNA-hAT-Charlie); L 2 b (L I N E-2) per M E R 81 (DNA-hAT-Blackjack) under T R A P P C 9; L T R 43 (L T R-E R V 1); and T R A P P C 9 (x-axis).
Figure 5.
Intersection between differentially enriched H3K4me3 (deH3K4me3) peaks and merged differentially methylated CpGs (mDMCs) in sperm of South African men with different p,p-DDE serum levels. (A) Proportion of MCC-seq target regions that overlap a H3K4me3 peak in sperm (overlap=157,753; dark gray donut plot), proportion of H3K4me3 peaks in sperm that overlap a MCC-seq target region (overlap=34,741; light gray donut plot), proportion of South Africa mDMCs that overlap a H3K4me3 peak in sperm (overlap=1,744; dark green donut plot), proportion of H3K4me3 peaks in sperm that intersect a South Africa mDMC (overlap=1,618; light green), proportion of South Africa mDMCs that overlap a deH3K4me3 peak in sperm (overlap=106; dark blue), proportion of South Africa deH3K4me3 peaks in sperm that intersect an mDMC in sperm (overlap=99; light blue). (B) Scatter plot corresponding to South African population ter3 to ter1 log2-fold change (H3K4me3 counts at overlapping mDMCs +1) relative to South Africa beta value at the mDMCs. Gray dots correspond to mDMCs that do not overlap with a deH3K4me3 peak. mDMCs intersecting a deH3K4me3 peak are denoted by color. H3K4me3 loss + DNAme gain overlap was the predominant overlap (n=88 of 106). (C) Distribution of peaks with H3K4me3 loss overlapped to DNAme gain mDMC (light blue) and DNAme gain mDMC overlapped to H3K4me3 loss region (dark green) relative to the TSS. (D) Enrichment for peaks with H3K4me3 loss overlapped to DNAme gain mDMC (light blue) and DNAme gain mDMC overlapped to H3K4me3 loss region (dark green) at genic and CpG annotations. Positive enrichments are determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions resampled from deH3K4me3 peaks (light blue) and mDMCs (dark green), respectively. (E) Enrichment for peaks with H3K4me3 loss overlapped to DNAme gain mDMC (blue) and DNAme gain mDMC overlapped to H3K4me3 loss region (green) at TE annotations and characterized DNAme/H3K4me3 persistent regions (see Figure S2 and S5). Positive enrichments are determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions resampled from deH3K4me3 peaks (light blue) and mDMCs (dark green), respectively. (F) Selected significant pathways from GO analysis on peak with H3K4me3 loss that overlaps a DNAme gain mDMC at promoters and 1–5 kb genic space. Size of dots corresponds to the number of significant genes in the pathway. Color and position of dots correspond to log2 (weightFisher). See Excel Table S27. (G) Representative IGV tracks of peak with H3K4me3 loss and DNAme gain in South African sperm at an LTR-ERV1 and in the TRAPPC9 genic space. Purple shaded box corresponds to the mDMC. Tracks below are a zoom at the mDMC. Intermediate CpGs are indicated in purple. Note: ChiP-Seq, chromatin immunoprecipitation targeting histone H3K4me3 followed by sequencing; DNAme, DNA methylation; GO, Gene Ontology; H3K4me3, histone H3 lysine 4 trimethylation; hESC, human embryonic stem cell; ICM, inner cell mass; IGV, Integrative Genome Viewer; int, intermediate; MCC-seq, MethylC-Capture-sequencing; p,p-DDE, p,p-dichlorodiphenyldichloroethylene; TE, transposable element; ter, tertile; TSS, transcriptional start site; UTR, untranslated region.
Figure 5A is set of six donut charts titled DNA methylation and H3K4me3 overlaps. From left to right, the first donut chart displays the following information: 830,188 cases of M C C-sequence target regions and 157,753 cases of overlap to H3K4me3 peaks. The second donut chart displays the following information: 48,499 cases of H3K4me3 peaks and 34,741 cases of overlap to M C C-sequence target regions. The third donut chart displays the following information: 12,849 cases of South Africa merged differentially methylated CpGs and 1,744 cases of overlap to H3K4me3 peaks. The fourth donut chart displays the following information: 48,499 cases of H3K4me3 peaks and 1,618 cases of overlap to South Africa merged differentially methylated CpGs. The fifth donut chart displays the following information: 12,849 cases of South Africa merged differentially methylated CpGs and 106 cases of overlap of South Africa deH3K4me3 peaks. The sixth donut chart displays the following information: 1,855 cases of South Africa deH3K4me3 peaks and 99 cases of overlap to South Africa merged differentially methylated CpGs. Figure 5B is a scatter plot, plotting South Africa population ter 3 to ter 1, log to the base 2-fold change (H3K4me3 counts at overlap plus 1), ranging from negative 0.2 to 0.2 in increments of 0.1 (y-axis) across effect size–South Africa population, ranging from negative 1 to 1 in increments of (x-axis) for H3K4me3 gain, DNA methylation loss and H3K4me3 loss, DNA methylation gain. Figure 5C is a bar graph, plotting proportions of overlap (in percentage), ranging from 0 to 60 in increments of 20 (y-axis) across distance to transcriptional start site (kilobytes), ranging as less than 0.1, 0.1 to 1, 1 to 5, 5 to 10, 10 to 50, 50 to 100, and greater than 100 (x-axis) for H3K4me3 loss peak overlapped to DNA methylation gain merged differentially methylated CpGs and DNA methylation gain merged differentially methylated CpGs overlapped to H3K4me3 loss peak. Figures 5D and 5E are horizontal bar graphs, plotting CpG islands, CpG shelves, 3’UTRs, promoters, 5′UTRs, exons, first exons, CpG shores, 1 to 5 kilobytes; and intermediate DNA methylation sperm-to-zygote persistent regions, low DNA methylation sperm-to-ICM persistent regions, low DNA methylation sperm-to-zygote persistent regions, SINE MIRc, hESC enhancers, H3K4me3 sperm-to-ICM persistent regions, H3K4me3 sperm-to-4-cell persistent regions (y-axis) across lowercase z score, ranging from 0 to 10 in increments of 5 and 0 to 20 in increments of 10 (x-axis) for H3K4me3 loss peak overlapped to DNA methylation gain merged differentially methylated CpGs and DNA methylation gain merged differentially methylated CpGs overlapped to H3K4me3 loss peak. Figure 5F is a scatter dot plot, plotting nervous system development, immune response, embryonic organ development, transcription by RNA polymerase 2, RNA processing, cilium movement, cellular process involved in reproduction, cellular macromolecule catabolic process, neurogenesis, regulation of cilium assembly, spermatid development, regulation of nucleic acid-templated transcription, double-strand break repair, neutrophil mediated immunity, meiosis 1, and posttranslational protein modification (y-axis) across negative log to the base 2 (weightFisher), ranging from 1 to 5 in unit increments (x-axis) for significant, ranging from 2 to 6 in increments of 2. The scale depicts the negative log to the base 2 (weightFisher), ranging from 2 and 4 in unit increments. Figure 5G is a set of four Representative Integrative Genome Viewer tracks, plotting Int CpGs, including MCC-seq and H3K4me3 ChIP-seq, each for ter 3, ter 2, and ter 1; MCC-seq and H3K4me3 ChIP-seq, each ranging as ter 3, ter 2, and ter 1; MCC-seq, ranging as ter 3, ter 2, and ter 1; and MCC-seq and H3K4me3 ChIP-seq, ranging as ter 3, ter 2, and ter 1 (y-axis) across L T R 43 (L T R-E R V 1) and M E R 5 A (DNA-hAT-Charlie); L 2 b (L I N E-2) per M E R 81 (DNA-hAT-Blackjack) under T R A P P C 9; L T R 43 (L T R-E R V 1); and T R A P P C 9 (x-axis).
Figure 5.
Intersection between differentially enriched H3K4me3 (deH3K4me3) peaks and merged differentially methylated CpGs (mDMCs) in sperm of South African men with different p,p-DDE serum levels. (A) Proportion of MCC-seq target regions that overlap a H3K4me3 peak in sperm (overlap=157,753; dark gray donut plot), proportion of H3K4me3 peaks in sperm that overlap a MCC-seq target region (overlap=34,741; light gray donut plot), proportion of South Africa mDMCs that overlap a H3K4me3 peak in sperm (overlap=1,744; dark green donut plot), proportion of H3K4me3 peaks in sperm that intersect a South Africa mDMC (overlap=1,618; light green), proportion of South Africa mDMCs that overlap a deH3K4me3 peak in sperm (overlap=106; dark blue), proportion of South Africa deH3K4me3 peaks in sperm that intersect an mDMC in sperm (overlap=99; light blue). (B) Scatter plot corresponding to South African population ter3 to ter1 log2-fold change (H3K4me3 counts at overlapping mDMCs +1) relative to South Africa beta value at the mDMCs. Gray dots correspond to mDMCs that do not overlap with a deH3K4me3 peak. mDMCs intersecting a deH3K4me3 peak are denoted by color. H3K4me3 loss + DNAme gain overlap was the predominant overlap (n=88 of 106). (C) Distribution of peaks with H3K4me3 loss overlapped to DNAme gain mDMC (light blue) and DNAme gain mDMC overlapped to H3K4me3 loss region (dark green) relative to the TSS. (D) Enrichment for peaks with H3K4me3 loss overlapped to DNAme gain mDMC (light blue) and DNAme gain mDMC overlapped to H3K4me3 loss region (dark green) at genic and CpG annotations. Positive enrichments are determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions resampled from deH3K4me3 peaks (light blue) and mDMCs (dark green), respectively. (E) Enrichment for peaks with H3K4me3 loss overlapped to DNAme gain mDMC (blue) and DNAme gain mDMC overlapped to H3K4me3 loss region (green) at TE annotations and characterized DNAme/H3K4me3 persistent regions (see Figure S2 and S5). Positive enrichments are determined by z-scores using the Bioconductor package regioneR. For all annotations displayed, p<0.0001 and n=10,000 permutations of random regions resampled from deH3K4me3 peaks (light blue) and mDMCs (dark green), respectively. (F) Selected significant pathways from GO analysis on peak with H3K4me3 loss that overlaps a DNAme gain mDMC at promoters and 1–5 kb genic space. Size of dots corresponds to the number of significant genes in the pathway. Color and position of dots correspond to log2 (weightFisher). See Excel Table S27. (G) Representative IGV tracks of peak with H3K4me3 loss and DNAme gain in South African sperm at an LTR-ERV1 and in the TRAPPC9 genic space. Purple shaded box corresponds to the mDMC. Tracks below are a zoom at the mDMC. Intermediate CpGs are indicated in purple. Note: ChiP-Seq, chromatin immunoprecipitation targeting histone H3K4me3 followed by sequencing; DNAme, DNA methylation; GO, Gene Ontology; H3K4me3, histone H3 lysine 4 trimethylation; hESC, human embryonic stem cell; ICM, inner cell mass; IGV, Integrative Genome Viewer; int, intermediate; MCC-seq, MethylC-Capture-sequencing; p,p-DDE, p,p-dichlorodiphenyldichloroethylene; TE, transposable element; ter, tertile; TSS, transcriptional start site; UTR, untranslated region.
Figure 6 is a scientific illustration that depicts the following information: On the top-left, there are 47 cases of Greenlandic men (INUENDO cohort), where diet, including p,p′-dichlorodiphenyldichloroethylene indirect exposure, leads to serum p,p′-dichlorodiphenyldichloroethylene mean: 870.734 plus or minus 134.030 nanograms per milliliter. The p,p′-dichlorodiphenyldichloroethylene mean has a bidirectional relationship with normal semen parameters. On the top-right, there are 50 cases of South African men (VhaVenda cohort), where indoor residual spraying, including dichlorodiphenyltrichloroethane and p,p′-dichlorodiphenyldichloroethylene direct exposure, leads to serum p,p′-dichlorodiphenyldichloroethylene mean: 10,462.228 plus or minus 1,792.298 nanograms per milliliter. The p,p′-dichlorodiphenyldichloroethylene mean has a bidirectional relationship with normal semen parameters. At the center-left, under MethylC-Capture-sequencing (M C C-seq; DNA methylation), a set of two graphs represents differently methylated CpGs in sperm (D M Cs). A Venn diagram has two circles. The circle on the left is labeled Greenland differentially methylated CpGs, and the circle on the right is labeled South Africa differentially methylated CpGs. The intersection area is labeled overlap. A line graph titled dose–response-like trend, plotting percentage DNA methylation changes in sperm (y-axis) across serum p,p′-dichlorodiphenyldichloroethylene. A horizontal bar graph plots significant pathways (y-axis) across lowercase p value (x-axis) for brain enhancers, sperm enhancers, neurogenesis genes, and axon guidance genes. At the center-right, under chromatin immunoprecipitation followed by sequencing (ChiP-seq; H3K4me3), there are two Representative Integrative Genome Viewer graphs, plotting high serum p,p′-dichlorodiphenyldichloroethylene, normal serum p,p′-dichlorodiphenyldichloroethylene, and low serum p,p′-dichlorodiphenyldichloroethylene (y-axis) across peaks with H3K4me3 gain in sperm and peaks with H3K4me3 loss in sperm (x-axis). Below, there are two horizontal bar graphs, plotting significant pathways (y-axis) across lowercase p value (x-axis) for sperm enhancers, hormone genes, axon guidance genes, and cellular process genes; and brain enhancers, placenta enhancers, neurogenesis genes, and chromatin genes. At the bottom, an illustration displays the following information: On the left, sperm-to-embryo persistently merged differently methylated CpGs lead to young transposable elements: L T R-E R V 1. The young transposable elements: L T R-E R V 1 has a bidirectional relationship to population child health, including A D D, A D H D, autism, behavioral problems, and lower birth weight. On the right, sperm-to-embryo persistent deH3K4me3 peaks lead to young transposable elements: L I N E-2, S I N E-Mir, and S I N E-Alu. The young transposable elements L I N E-2, S I N E-Mir, and S I N E-Alu have a bidirectional relationship to population child health, including neurodevelopmental disorders, immune impairments, and urogenital malformations.
Figure 6.
Overview and author interpretation of the main findings; a summary of the consequences of p,p-DDE exposure on the sperm epigenome of Greenlandic Inuit and South African VhaVenda men and the health implications for the next generation. We assessed the sperm epigenome (DNAme and H3K4me3) of two geographically diverse populations, Greenlandic Inuit and South African men, in relationship to p,p-DDE serum levels. Dose–response trends between sperm DNAme/H3K4me3 and p,p-DDE serum levels were observed within both populations. Regions with altered epigenetic marks in sperm were related to neurodevelopment and fertility, co-localized to young TEs, and overlapped regions that were predicted to resist epigenetic reprogramming in the preimplantation embryo. Note: ADD, attention deficit disorder; ADHD, attention deficit/hyperactivity disorder; ChIP-seq, chromatin immunoprecipitation targeting histone H3K4me3 followed by sequencing; DDT, dichlorodiphenyltrichloroethane; DMCs, differentially methylated CpGs; DNAme, DNA methylation; H3K4me3, histone H3 lysine 4 trimethylation; p,p-DDE, p,p-dichlorodiphenyldichloroethylene; TEs, transposable elements.

References

    1. Lallas PL. 2001. The Stockholm convention on persistent organic pollutants. Am j Int Law 95(3):692–708, 10.2307/2668517. - DOI
    1. Turusov V, Rakitsky V, Tomatis L. 2002. Dichlorodiphenyltrichloroethane (DDT): ubiquity, persistence, and risks. Environ Health Perspect 110(2):125–128, PMID: , 10.1289/ehp.02110125. - DOI - PMC - PubMed
    1. van den Berg H, Manuweera G, Konradsen F. 2017. Global trends in the production and use of DDT for control of malaria and other vector-borne diseases. Malar J 16(1):401, PMID: , 10.1186/s12936-017-2050-2. - DOI - PMC - PubMed
    1. WHO (World Health Organization). 2006. Indoor Residual Spraying: Use of Indoor Residual Spraying for Scaling up Global Malaria Control and Elimination: WHO Position Statement. Geneva, Switzerland: World Health Organization. https://apps.who.int/iris/bitstream/handle/10665/69386/WHO_HTM_MAL_2006.... [accessed 1 August 2021].
    1. Kelce WR, Stone CR, Laws SC, Gray LE, Kemppainen JA, Wilson EM. 1995. Persistent DDT metabolite p,p′-DDE is a potent androgen receptor antagonist. Nature 375(6532):581–585, PMID: , 10.1038/375581a0. - DOI - PubMed

LinkOut - more resources