This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2024 Jul 15:2024.07.15.24310282.

doi: 10.1101/2024.07.15.24310282.

Multi-omics characterization of type 2 diabetes associated genetic variation

Ravi Mandla^{1

2

3}, Kim Lorenz^{4

5

6

7}, Xianyong Yin^{8

9}, Ozvan Bocher¹⁰, Alicia Huerta-Chagoya^{1

3}, Ana Luiza Arruda^{10

11}, Anthony Piron^{12

13

14

15}, Susanne Horn¹⁰, Ken Suzuki^{16

17

18}, Konstantinos Hatzikotoulas¹⁰, Lorraine Southam¹⁰, Henry Taylor^{19

20

21}, Kaiyuan Yang^{22

23}, Karin Hrovatin^{24

25}, Yue Tong¹², Maria Lytrivi^{12

26}, Nigel W Rayner¹⁰, James B Meigs^{1

27

28}, Mark I McCarthy^{29

30

31}, Anubha Mahajan^{29

30}, Miriam S Udler^{1

2

3

27}, Cassandra N Spracklen³², Michael Boehnke⁹, Marijana Vujkovic^{4

33

34}, Jerome I Rotter³⁵, Decio L Eizirik¹², Miriam Cnop^{12

26

36}, Heiko Lickert^{22

23

37}, Andrew P Morris^{16

10

38}, Eleftheria Zeggini^{10

39}, Benjamin F Voight^{4

5

6

7}, Josep M Mercader^{1

2

3

28}

Affiliations

¹ Programs in Metabolism and Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
² Diabetes Unit, Endocrine Division, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA.
³ Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
⁴ Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA.
⁵ Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
⁶ Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
⁷ Institute for Translational Medicine and Therapeutics, University of Pennsylvania - Perelman School of Medicine, Philadelphia PA.
⁸ Department of Epidemiology, School of Public Health, Nanjing Medical University, Nanjing, China.
⁹ Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA.
¹⁰ Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.
¹¹ Graduate School of Experimental Medicine, Technical University of Munich, Munich, Germany.
¹² ULB Center for Diabetes Research, Medical Faculty, Université Libre de Bruxelles, Brussels, Belgium.
¹³ Interuniversity Institute of Bioinformatics in Brussels (IB2), Brussels, Belgium.
¹⁴ Machine Learning Group, Université Libre de Bruxelles, Brussels, Belgium.
¹⁵ Diabetes and Inflammation Laboratory, Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom.
¹⁶ Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological Sciences, University of Manchester, Manchester, UK.
¹⁷ Department of Diabetes and Metabolic Diseases, Graduate School of Medicine, University of Tokyo, Tokyo, Japan.
¹⁸ Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan.
¹⁹ Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
²⁰ British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
²¹ Heart and Lung Research Institute, University of Cambridge, Cambridge, UK.
²² Institute of Diabetes and Regeneration Research (IDR), Helmholtz Munich, Neuherberg, Germany.
²³ German Center for Diabetes Research (DZD), Neuherberg, Germany.
²⁴ Institute of Computational Biology (ICB), Helmholtz Munich, Neuherberg, Germany.
²⁵ School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
²⁶ Division of Endocrinology, Erasmus Hospital, Universite Libre de Bruxelles, Brussels, Belgium.
²⁷ Department of Medicine, Harvard Medical School, Boston, MA, USA.
²⁸ Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA.
²⁹ Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK.
³⁰ Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, UK.
³¹ Oxford NIHR Biomedical Research Centre, Churchill Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
³² Department of Biostatistics and Epidemiology, University of Massachusetts Amherst, Amherst, MA, USA.
³³ Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
³⁴ Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
³⁵ Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA.
³⁶ WEL Research Institute, Wavre, Belgium.
³⁷ School of Medicine, Technical University of Munich, Munich, Germany.
³⁸ Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia.
³⁹ TUM School of Medicine and Health, Technical University of Munich and Klinikum Rechts der Isar, Munich, Germany.

PMID: 39072045
PMCID: PMC11275663
DOI: 10.1101/2024.07.15.24310282

Multi-omics characterization of type 2 diabetes associated genetic variation

Ravi Mandla et al. medRxiv. 2024.

[Preprint]. 2024 Jul 15:2024.07.15.24310282.

doi: 10.1101/2024.07.15.24310282.

Authors

Affiliations

¹ Programs in Metabolism and Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
² Diabetes Unit, Endocrine Division, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA.
³ Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
⁴ Corporal Michael J. Crescenz VA Medical Center, Philadelphia, PA, USA.
⁵ Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
⁶ Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
⁷ Institute for Translational Medicine and Therapeutics, University of Pennsylvania - Perelman School of Medicine, Philadelphia PA.
⁸ Department of Epidemiology, School of Public Health, Nanjing Medical University, Nanjing, China.
⁹ Department of Biostatistics and Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, USA.
¹⁰ Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany.
¹¹ Graduate School of Experimental Medicine, Technical University of Munich, Munich, Germany.
¹² ULB Center for Diabetes Research, Medical Faculty, Université Libre de Bruxelles, Brussels, Belgium.
¹³ Interuniversity Institute of Bioinformatics in Brussels (IB2), Brussels, Belgium.
¹⁴ Machine Learning Group, Université Libre de Bruxelles, Brussels, Belgium.
¹⁵ Diabetes and Inflammation Laboratory, Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom.
¹⁶ Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Division of Musculoskeletal and Dermatological Sciences, University of Manchester, Manchester, UK.
¹⁷ Department of Diabetes and Metabolic Diseases, Graduate School of Medicine, University of Tokyo, Tokyo, Japan.
¹⁸ Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan.
¹⁹ Center for Precision Health Research, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA.
²⁰ British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
²¹ Heart and Lung Research Institute, University of Cambridge, Cambridge, UK.
²² Institute of Diabetes and Regeneration Research (IDR), Helmholtz Munich, Neuherberg, Germany.
²³ German Center for Diabetes Research (DZD), Neuherberg, Germany.
²⁴ Institute of Computational Biology (ICB), Helmholtz Munich, Neuherberg, Germany.
²⁵ School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
²⁶ Division of Endocrinology, Erasmus Hospital, Universite Libre de Bruxelles, Brussels, Belgium.
²⁷ Department of Medicine, Harvard Medical School, Boston, MA, USA.
²⁸ Division of General Internal Medicine, Massachusetts General Hospital, Boston, MA, USA.
²⁹ Wellcome Centre for Human Genetics, Nuffield Department of Medicine, University of Oxford, Oxford, UK.
³⁰ Oxford Centre for Diabetes, Endocrinology and Metabolism, Radcliffe Department of Medicine, University of Oxford, Oxford, UK.
³¹ Oxford NIHR Biomedical Research Centre, Churchill Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK.
³² Department of Biostatistics and Epidemiology, University of Massachusetts Amherst, Amherst, MA, USA.
³³ Department of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
³⁴ Department of Biostatistics, Epidemiology and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.
³⁵ Institute for Translational Genomics and Population Sciences, Department of Pediatrics, Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA.
³⁶ WEL Research Institute, Wavre, Belgium.
³⁷ School of Medicine, Technical University of Munich, Munich, Germany.
³⁸ Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia.
³⁹ TUM School of Medicine and Health, Technical University of Munich and Klinikum Rechts der Isar, Munich, Germany.

PMID: 39072045
PMCID: PMC11275663
DOI: 10.1101/2024.07.15.24310282

Abstract

Discerning the mechanisms driving type 2 diabetes (T2D) pathophysiology from genome-wide association studies (GWAS) remains a challenge. To this end, we integrated omics information from 16 multi-tissue and multi-ancestry expression, protein, and metabolite quantitative trait loci (QTL) studies and 46 multi-ancestry GWAS for T2D-related traits with the largest, most ancestrally diverse T2D GWAS to date. Of the 1,289 T2D GWAS index variants, 716 (56%) demonstrated strong evidence of colocalization with a molecular or T2D-related trait, implicating 657 cis-effector genes, 1,691 distal-effector genes, 731 metabolites, and 43 T2D-related traits. We identified 773 of these cis- and distal-effector genes using either expression QTL data from understudied ancestry groups or inclusion of T2D index variants enriched in underrepresented populations, emphasizing the value of increasing population diversity in functional mapping. Linking these variants, genes, metabolites, and traits into a network, we elucidated mechanisms through which T2D-associated variation may impact disease risk. Finally, we showed that drugs targeting effector proteins were enriched in those approved to treat T2D, highlighting the potential of these results to prioritize drug targets for T2D. These results represent a leap in the molecular characterization of T2D-associated genetic variation and will aid in translating genetic findings into novel therapeutic strategies.

PubMed Disclaimer

Conflict of interest statement

Disclosures J.B.M is an academic associate for Quest Diagnostics Inc. Endocrine R&D. MIMcC is now an employee of Genentech and a holder of Roche stock.

Figures

**Figure 1:. Overview of Project.**
a) Genomic regions containing the 1,289 T2D-associated index variants from Suzuki et al. were tested for evidence of colocalization with 10 eQTL datasets, 4 pQTL datasets, 2 metabQTL datasets, and 46 related trait GWAS. Colocalizations were then mapped to an interactive network to visualize genes (from an eQTL or pQTL, colored green), metabolites (colored orange), or traits (colored purple) with evidence of sharing a causal variant with T2D around a index variant (colored yellow). These results were then used in downstream analyses to identify enrichment with expression datasets, better understand T2D pathways, and identify drug targets for T2D. b) Upset plot of the 716 T2D index variants mapped to an association in an eQTL, pQTL, metabQTL, or trait GWAS dataset (with a colocalization PP.H4 >0.8, Methods). c) Bar plot of the number of effector genes, metabolites, and traits identified from colocalization analyses with T2D.

**Figure 2:. Identification of putative effector genes for T2D.**
a) Plotted are the number of effector genes for T2D previously identified from colocalization analyses between various T2D GWAS and eQTL datasets. Colors indicate the tissue type of the eQTL dataset and shape indicates major self-reported population group of the eQTL dataset. Gray bars represent the total number of unique transcripts across all colocalization analyses per GWAS. b) Upset plot of the variant to gene mappings identified in each eQTL dataset analyzed. c) Proportion of colocalizations with evidence in one tissue (PP.H4 >0.8) and no positive evidence observed in other tissues (PP.H4 <0.3). d) Example of a colocalization observed only in Pancreatic Islets, for the gene *SCTR*. Colors indicate LD in EUR populations from 1000G relative to rs2244214.

**Figure 3:. pQTL colocalizations identified in multiple datasets.**
a) Upset plot of variant to gene mappings identified via colocalization analyses with four different pQTL datasets. b) Correlation of pQTL effect sizes for colocalizations identified in both UKB and deCODE pQTL datasets (Pearson R=0.93; P=9.1 × 10⁻¹⁷). c) Locus compare plot of *CBLN4* using deCODE pQTL data. d) Colocalization subnetwork of rs1415287, containing the colocalizations with CBLN4 as well as a colocalization with IGFBP-1. Green nodes represent genes from an eQTL or pQTL, orange nodes represent metabolites, purple nodes represent traits, and yellow nodes represent T2D index variants. Size of the nodes indicate the number of colocalizations observed in the full network. Gray edges represent colocalizations with a plasma/blood dataset, green edges represent colocalizations with a subcutaneous adipose eQTL dataset, and pink edges represent colocalizations with a trait. Dashed lines indicate colocalizations are in the opposite direction as T2D risk and solid lines indicate colocalizations are in the same direction as T2D risk.

**Figure 4:. eQTL from understudied populations identify novel colocalizations with T2D.**
a) Upset plot of colocalizations identified with blood eQTL datasets from four different populations. Orange bars represent colocalizations observed only in the Puerto Rican (PR), Mexican American (MX), or African American (AA) datasets (PP.H4 >0.8) and not in the European (EUR) dataset (PP.H4 <0.3). Blue bars represent colocalizations observed only in the EUR dataset (PP.H4 >0.8) and not in PR, MX, or AA datasets (PP.H4 <0.3). Gray bars represent colocalizations observed in any of the PR, MX, or AA datasets and showed a PP.H4 between 0.3 and 0.8 in the European dataset, or were observed in the European dataset but showed a PP.H4 between 0.3 and 0.8 in any of the PR, MX, or AA datasets. b) Log-fold change of allele frequencies between AMR-like and EUR-like populations for T2D index variants with a colocalization observed in one population group. c) Locus compare plots of *LIN7A* and *ACSS3* with MX data. Colors indicate LD in AMR continental ancestry from 1000G relative to variant rs10128882. d) Allele frequencies of lead colocalizing variant rs10128882 (blue) and T2D index variant rs11114650 (black) per continental ancestry and per inferred local ancestry among AMR participants from gnomAD v4.0. e) Effect sizes of rs10128882 (blue) and rs11114650 (black) in both the T2D GWAS and blood eQTL datasets, stratified by ancestry.

**Figure 5:. Phosphatidylcholine has consistent negative effect directions with T2D risk.**
a) Joint scatterplot of metabolites comparing the number of T2D-associated index variants they are mapped to compared to the change in T2D effect size per change in metabQTL effect size across the variants. Points are colored and sized by Benjamoni-Hochberg adjusted p-value. b) Increased glucose has consistent correlation with increased T2D risk at colocalizing index variants. c) Decreased phosphatidylcholine has consistent correlation with decreased T2D risk at index variants. d) Subnetwork of all colocalizations mapped to a Phosphatidylcholine-mapped T2D-associated index variant. Green nodes represent genes from an eQTL or pQTL, orange nodes represent metabolites, purple nodes represent traits, and yellow nodes represent T2D index variants. Size of the nodes indicate the number of colocalizations observed in the full network.

**Figure 6:. Type 2 Diabetes Clusters Match Trait Colocalizations.**
**a) left panel:** Color indicates the percentage of total SNPs with a colocalization in each cluster; each row was a trait used for clustering. Final rows list total number of SNPs in each cluster and the total % of SNPs in the cluster with at least one colocalization. **right panel:** Color indicates the percentage of colocalizing SNPs who have the same (red) or opposite (blue) effect direction as the linked T2D SNP, with white indicating that SNPs are split 50/50 for direction, and gray indicating there were no colocalizations between that trait and any SNPs in that cluster. Text in each square indicates the total number of SNPs in that cluster that colocalize with that trait. b) Upset plot of all colocalizing SNPs with 2 or more trait group colocalizations showing trait group overlaps, with bars colored by cluster membership. c) Upset plot showing trait group overlaps for colocalizing SNPs in the lipodystrophy cluster, with gold bars highlighting overlaps that include 4 or more trait groups (n=19 out of 27 colocalizing SNPs).

**Figure 7:. Identification of novel drug targets for T2D.**
a) Forest plot of chi2 enrichment results between drugs targeting colocalizing genes identified in at least one dataset (all), only from an eQTL, only from a pQTL, and only from a *trans*-pQTL with drugs having an approved indication of diabetes in Open Targets. b) Forest plot of enrichment results using drugs having an indication of diabetes. c) Locus compare plot of *FXYD2*, identified from a colocalization observed only with pancreatic islet eQTL data. d) Violin plot of *FXYD2* expression per rs529623 genotype. e) Violin plot of *FXYD2* expression among people with T2D and non-diabetes (ND) stratified by weight status in bulk pancreatic islet data. f) Violin plot of *FXYD2* expression from human donor single-cell pancreatic islet data stratified by cell-type. Adjusted P-values <0.001 are indicated with “**”, and adjusted P-values <0.05 with “*”. Due to the low number of cells per sample, the differential expression test was not performed for some cell types (**Methods**).

See this image and copyright information in PMC

References

1. Herman W. H. & Zimmet P. Type 2 Diabetes: An Epidemic Requiring Global Attention and Urgent Action. Diabetes Care 35, 943–944 (2012). - PMC - PubMed
1. Jaacks L. M., Siegel K. R., Gujral U. P. & Narayan K. M. V. Type 2 diabetes: A 21st century epidemic. Best Pract. Res. Clin. Endocrinol. Metab. 30, 331–343 (2016). - PubMed
1. Rusina P. V. et al. Genetic support for FDA-approved drugs over the past decade. Nat. Rev. Drug Discov. 22, 864–864 (2023). - PubMed
1. Minikel E. V., Painter J. L., Dong C. C. & Nelson M. R. Refining the impact of genetic evidence on clinical success. Nature 1–6 (2024) doi:10.1038/s41586-024-07316-0. - DOI - PMC - PubMed
1. Duffy Á. et al. Development of a human genetics-guided priority score for 19,365 genes and 399 drug indications. Nat. Genet. 56, 51–59 (2024). - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

Multi-omics characterization of type 2 diabetes associated genetic variation

Affiliations

Multi-omics characterization of type 2 diabetes associated genetic variation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources