Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2024 Nov 28:2024.11.26.24317923.
doi: 10.1101/2024.11.26.24317923.

BMI Interacts with the Genome to Regulate Gene Expression Globally, with Emphasis in the Brain and Gut

Affiliations

BMI Interacts with the Genome to Regulate Gene Expression Globally, with Emphasis in the Brain and Gut

Rebecca Signer et al. medRxiv. .

Abstract

Genome-wide association studies identify common genomic variants associated with disease across a population. Individual environmental effects are often not included, despite evidence that environment mediates genomic regulation of higher order biology. Body mass index (BMI) is associated with complex disorders across clinical specialties, yet has not been modeled as a genomic environment. Here, we tested for expression quantitative trait (eQTL) loci that contextually regulate gene expression across the BMI spectrum using an interaction approach. We parsed the impact of cell type, enhancer interactions, and created novel BMI-dynamic gene expression predictor models. We found that BMI main effects associated with endocrine gene expression, while interactive variant-by-BMI effects impacted gene expression in the brain and gut. Cortical BMI-dynamic loci were experimentally dysregulated by inflammatory cytokines in an in vitro system. Using BMI-dynamic models, we identify novel genes in nitric oxide signaling pathways in the nucleus accumbens significantly associated with depression and smoking. While neither genetics nor BMI are sufficient as standalone measures to capture the complexity of downstream cellular consequences, including environment powers disease gene discovery.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.. Identification of Associations of SNP and BMI on Gene Expression in the GTEx post-mortem dataset.
A) Quality control and linear model equations implemented to identify associations, created in Biorender B) Number of locally significant and genome-wide FDR < 0.01 in this Base-eGene analysis compared to GTEx v8 cis-eQTL summary statistics C) Differential expression analysis with colored FDR significant BMI-DE genes, red indicates gene’s expression is negatively associated with BMI, and Blue indicates gene expression is positively associated with BMI, and NS indicates the gene is not FDR significant. D) Number of genes in each tissue that have an FDR < 0.05 significant association with BMI from differential expression analysis, colored according to the primary body system of the tissue. E) Number of BMI-eGenes that are locally significant and FDR < 0.01 across tissues, subset by the number of samples in each tissue where >= 500 samples constitutes large, less than 500 but greater than or equal to 200 constitutes medium, and less than 200 samples constitutes small tissues F) Distribution of significant BMI-eGenes in the Brain: CAU: Caudate, basal ganglia; CNG: Anterior cingulate cortex; PUT: Putamen, basal ganglia; MFC: Cortex; HTH: Hypothalamus; PIT: Pituitary; AMY: Amygdala; HIP: Hippocampus; SN: Substantia Nigra; CB: Cerebellum
Figure 2.
Figure 2.. Function and Directionality of BMI-eQTL and eGenes.
A) Within each model (Base-eQTL, BMI-eQTL, and BMI-DE genes) The proportion of genes with a high probability of loss-of-function intolerance (pLI) score and enrichment of high pLI genes compared to the background, * indicates p-binomial < 0.05, ** indicated p-binomial < 0.01, *** indicates p-binomial < 0.001 B) FDR Significant gene sets enriched in tissues not including Adipose and tissues with N-BMI-eGenes < 20 C-F) Examples of directionality of eQTL showing the data in the significant interaction model equation (top) and the categorical models (bottom), which are used to define the direction. Of note, BMI categories were separated into quintiles where m1, m2, and m3 represent the three middle quintiles. Red dashed lines indicate the threshold for low and high BMI quintiles within tissue. Light blue points indicate individuals who are homozygous reference, dark blue points are heterozygous alternate, and purple are homozygous alternate C) Example of a positive BMI-eQTL: chr12_55900153_A_G_b38 regulating IKZF4 in Transverse Colon D) Example of a negative BMI-eQTL: chr4_5281524_A_G_b38 regulating CRMP1 in Cortex E) Example of a flipped BMI-eQTL: chr9_111930711_G_A_b38 regulating LPAR1 in Adipose, Subcutaneous F) Example of uncertain interaction: chr17_32888312_G_T_b38 regulating PSMD11 in Minor Salivary Gland G) The directionality category that is the most numerous within tissue
Figure 3.
Figure 3.. Parsing the Impact of Sequence and Cell Type on BMI-dynamic Functions.
A) Enrichment for predicted transcription factor affinity binding changed in BMI-eQTL credible sets compared to Base-eQTL credible sets B) Methodologic summary of cell type deconvolution and cell-type specific eQTL calling in GTEx bulk mRNA sequencing data created in Biorender C) Spearman Correlation of predicted cell type proportion and BMI within each tissue examined, labeled are those with a significant p< 0.05 correlation with BMI D) FDR significant gene set enrichment in genes significant after cell types are corrected and within cell-type in small intestine and transverse colon, intersection size > 1 E) Cell types with significant enrichment amongst BMI-eQTL lead SNPs in credible sets compared to Base-eQTL
Figure 4.
Figure 4.. Functional validation of Cortex BMI-eQTL in iPSC Glutamatergic Neurons.
A) Lead cell types of top SNPs in Base-eGenes (Left), and BMI-eGenes (Right), excitatory neurons are starred to indicate these credible sets are prioritized for modeling in iPSC system. B) Genes with evidence of activity by contact in at least one context (ABC >= 0.1) and evidence of a shift in activity compared to vehicle (|ABC-Context – ABC-Vehicle| >= 0.02) C) Amongst SNPs with an ABC-shift, SNPs that have significant transcription factor affinity binding score D) The locus of the SNP rs6927697 with the BMI-eQTL statistic for gene SLC16A10, ABC score, FDR-significant MZF1 binding prediction on chromosome 6.
Figure 5.
Figure 5.. Model Creation to Predict Gene Expression using BMI, SNPs, and BMI-SNP Interactions
A) Graphical representation of models built with main and interactive effects created in Biorender B) The number of BMI-dynamic models created using glinternet compared to the number of models in the published PredictDB GREx data C) The difference in cross fold R-squared from BMI-dynamic models compared to the current published PredictDB GREx models, colored by the types of predictors selected and annotated with the p-value of the t-test. Of note, the glinternet package forces the main effects to be included if the interaction is selected. D) In the BMI-dynamic models, the number of models created that select BMI as a main effect for each tissue, separated by if BMI is the sole predictor in that gene’s model, if SNPs are present as well, or if a BMI-SNP interaction is present
Figure 6.
Figure 6.. Gene Discovery in Psychiatric Traits using Machine Learning
A) Graphical Representation of association analysis completed in UK Biobank to identify gene-tissues associated with psychiatric traits created in Biorender B) The number of genes with predicted expression significantly associated with each trait in the UK biobank using both the published PredictDB GREx models and the BMI-dynamic models C) Gene set enrichment of genes significant in each tissue-trait association compared to the background of genes expressed in that tissue D) For genes not identified in the PredictDB-PrediXcan, the distribution of predictor types in each significant model E) S-predixcan association statistics from 16 GWAS of complex traits for genes significant in both the BMI-eGene analysis and the UKB association analysis are subset in cortex. X axes are labeled with trait abbreviation and year of the GWAS. Trait abbreviations are as follows: AN_2019=anorexia nervosa; MD_2024=Major depression, 2024; ALS_2021=amyotrophic lateral sclerosis; BIP_2021=bipolar disorder- all; ADHD_2019=Attention-deficit/hyperactivity disorder; BMI_2018=Body mass index; T2D_2022=Type 2 diabetes; TS_2019=Tourette Syndrome; ALZ_2021=Alzheimer; Menarche_2017=Age at Menarche; Menopause_2021=Age at Menopause

References

    1. Reza N., Alford R. L., Belmont J. W. & Marston N. The Expansion of Genetic Testing in Cardiovascular Medicine: Preparing the Cardiology Community for the Changing Landscape. Curr. Cardiol. Rep. 26, 135–146 (2024). - PMC - PubMed
    1. Dratch L. et al. Genetic testing in adults with neurologic disorders: indications, approach, and clinical impacts. J. Neurol. 271, 733–747 (2024). - PMC - PubMed
    1. Hingorani A. D. et al. Performance of polygenic risk scores in screening, prediction, and risk stratification: secondary analysis of data in the Polygenic Score Catalog. BMJ Med. 2, e000554 (2023). - PMC - PubMed
    1. Johansson Å. et al. Precision medicine in complex diseases-Molecular subgrouping for improved prediction and treatment stratification. J. Intern. Med. 294, 378–396 (2023). - PMC - PubMed
    1. Gibson G. Population genetics and GWAS: A primer. PLoS Biol. 16, e2005485 (2018). - PMC - PubMed

Publication types

LinkOut - more resources