Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 May;50(5):699-707.
doi: 10.1038/s41588-018-0102-3. Epub 2018 Apr 16.

Transcription factors operate across disease loci, with EBNA2 implicated in autoimmunity

Affiliations

Transcription factors operate across disease loci, with EBNA2 implicated in autoimmunity

John B Harley et al. Nat Genet. 2018 May.

Abstract

Explaining the genetics of many diseases is challenging because most associations localize to incompletely characterized regulatory regions. Using new computational methods, we show that transcription factors (TFs) occupy multiple loci associated with individual complex genetic disorders. Application to 213 phenotypes and 1,544 TF binding datasets identified 2,264 relationships between hundreds of TFs and 94 phenotypes, including androgen receptor in prostate cancer and GATA3 in breast cancer. Strikingly, nearly half of systemic lupus erythematosus risk loci are occupied by the Epstein-Barr virus EBNA2 protein and many coclustering human TFs, showing gene-environment interaction. Similar EBNA2-anchored associations exist in multiple sclerosis, rheumatoid arthritis, inflammatory bowel disease, type 1 diabetes, juvenile idiopathic arthritis and celiac disease. Instances of allele-dependent DNA binding and downstream effects on gene expression at plausibly causal variants support genetic mechanisms dependent on EBNA2. Our results nominate mechanisms that operate across risk loci within disease phenotypes, suggesting new models for disease origins.

PubMed Disclaimer

Conflict of interest statement

Competing Financial Interests Statement

J.B.H., M.T.W., and L.C.K. have a submitted patent application relating to these findings. A.B. is a co-founder of Datirium, LLC.

Figures

Figure 1
Figure 1. Intersection between autoimmune risk loci and TF binding interactions with the genome
a. Results for SLE risk loci. X-axis displays SLE-associated loci. Y-axis displays the top 25 TFs, based on RELI P-values, sorted by the number of loci. A colored box indicates that the given locus contains at least one SLE-associated variant located within a ChIP-seq peak for the given TF. The most significant ChIP-seq dataset cell type is indicated in parentheses. TFs that participate in “EBNA2 super-enhancers” are colored red. The red rectangle identifies those loci and TFs that optimally cluster together (see Online Methods). Bottom panel, left: comparison of EBV-infected B cell lines (grey bars) to EBV negative B cells (white bars). The Y-axis shows the distribution of the RELI –log (Pcs) for each of the eight TFs with available data. Bars indicate mean. Error bars indicate standard deviation. Numbers indicate number of datasets. Horizontal line indicates the Pc<10−6 RELI significance threshold. Bottom panel, right: The top 10 TFs (based on RELI Pc-values) with data available in at least one EBV-infected B cell line (grey bars) and at least one other cell type (white bars). b–g. Results for the other six EBNA2 disorders. Full results are available in Supplementary Data Set 5.
Figure 2
Figure 2. Properties of EBNA2-bound autoimmune disease loci
a. Schematic of the RELI algorithm. See Online Methods for details. b. TFs intersecting autoimmune risk loci occupied by EBNA2. RELI was re-executed using EBNA2 disorder variants intersecting EBNA2 ChIP-seq peaks as input. Top TFs are indicated. NFκB subunits are shown in red. Basal transcriptional machinery proteins are shown in blue. c. Most EBNA2-occupied loci are associated with only a single EBNA2 disorder. EBNA2-bound loci were categorized by the number of EBNA2 disorders with which the given locus is associated (X-axis). d. Functional properties of EBNA2 disorder EBNA2-occupied loci. Functional importance of EBNA2-occupied loci, assessed with four criteria. In each panel, variants are segregated into two categories – common variants (left bars) and common variants associated with at least one EBNA2 disorder (right bars). Each category is divided into three types of variants (see key). The Y-axis of each plot indicates the percent of variants in each group that are, for example, eQTLs in EBV-infected B cells (top left plot). Error bars indicate the standard deviation obtained from sampling (with replacement) of 50% of the variants. Values below indicate number of variants. Horizontal bars at the top indicate sampling-derived P-values based on Welch’s one-sided t-test.
Figure 3
Figure 3. Allele-dependent binding of EBNA2 to autoimmune-associated genetic variants
a. Theoretical models presenting possible allele-dependent action of EBNA2. See text for discussion. b. Allele-dependent co-binding of EBNA2 with multiple proteins. ChIP-seq datasets from EBV-infected B cell lines were examined for evidence of allele-dependent binding at heterozygotes. Datasets are sorted by the proportion of EBNA2 GM12878 allele-dependent events (MARIO ARS value > 0.40, see Online Methods) that favor the same allele (X-axis). Values (N) indicate total number of variants. c. Allele-dependent binding of EBNA2 and human proteins at the CD44 locus. Top to bottom: chromosomal band (multi-colored bar), location of EBV-infected B cell line ChIP-seq peaks for various TFs, location of rs3794102 variant, allele-dependent binding events (green bars). X-axis indicates the preferred allele, along with a value indicating the strength of the allelic behavior, calculated as one minus the ratio of the weak to strong reads (e.g., 0.5 indicates the strong allele has approximately twice the reads of the weak allele). d. Allele and EBV-dependent expression of CD44. Allelic qPCR of CD44 expression in EBV-infected and EBV negative Ramos B cells (see key). Fold-change in expression is provided relative to the C allele. Error bars represent standard deviation (n=12: three independent experiments of technical quadruplicates). P-values were calculated using a two-way ANOVA with a Tukey post-hoc test. EBV status and variant genotype were used as the two factors.
Figure 4
Figure 4. Cell types and TFs at disease-associated loci
a. SLE variants significantly intersect H3K27ac-marked regions in EBV-infected B cells. H3K27ac ChIP-seq peaks were collected from 175 different cell lines and types. The Y-axis indicates the negative log of the RELI P-value for the intersection of SLE-associated variants with H3K27ac peaks in each dataset. b. SLE variants intersect active chromatin regions in EBV-infected B cells. Same as (a), but instead using “active chromatin” regions, which are based on combinations of histone marks. c. Global view of RELI results – all diseases against all TFs. Columns and rows show the 94 phenotypes/diseases and 212 TFs with at least one significant (Pc<10−6) RELI result. Color indicates negative log of the RELI P-value (see key). Disease abbreviations are provided in the main text. d. TFs intersecting breast cancer loci. Intersection between disease loci with TF-bound DNA sequences, as in Figure 1. However, here the cluster of TFs and risk loci instead largely may operate in ductal epithelial cells, independently of EBNA2. The top 20 TFs are shown - full results are provided in Supplementary Data Set 3.

Comment in

References

    1. Fujinami RS, von Herrath MG, Christen U, Whitton JL. Molecular mimicry, bystander activation, or viral persistence: infections and autoimmune disease. Clin Microbiol Rev. 2006;19:80–94. doi: 10.1128/CMR.19.1.80-94.2006. - DOI - PMC - PubMed
    1. Ercolini AM, Miller SD. The role of infections in autoimmune disease. Clinical and experimental immunology. 2009;155:1–15. doi: 10.1111/j.1365-2249.2008.03834.x. - DOI - PMC - PubMed
    1. Sener AG, Afsar I. Infection and autoimmune disease. Rheumatol Int. 2012;32:3331–3338. doi: 10.1007/s00296-012-2451-z. - DOI - PubMed
    1. James JA, et al. An increased prevalence of Epstein-Barr virus infection in young patients suggests a possible etiology for systemic lupus erythematosus. J Clin Invest. 1997;100:3019–3026. doi: 10.1172/JCI119856. - DOI - PMC - PubMed
    1. Hanlon P, Avenell A, Aucott L, Vickers MA. Systematic review and meta-analysis of the sero-epidemiological association between Epstein-Barr virus and systemic lupus erythematosus. Arthritis research & therapy. 2014;16:R3. doi: 10.1186/ar4429. - DOI - PMC - PubMed

Publication types

Substances