Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct 1;107(4):670-682.
doi: 10.1016/j.ajhg.2020.08.016. Epub 2020 Sep 9.

Unsupervised Clustering of Missense Variants in HNF1A Using Multidimensional Functional Data Aids Clinical Interpretation

Affiliations

Unsupervised Clustering of Missense Variants in HNF1A Using Multidimensional Functional Data Aids Clinical Interpretation

Sara Althari et al. Am J Hum Genet. .

Abstract

Exome sequencing in diabetes presents a diagnostic challenge because depending on frequency, functional impact, and genomic and environmental contexts, HNF1A variants can cause maturity-onset diabetes of the young (MODY), increase type 2 diabetes risk, or be benign. A correct diagnosis matters as it informs on treatment, progression, and family risk. We describe a multi-dimensional functional dataset of 73 HNF1A missense variants identified in exomes of 12,940 individuals. Our aim was to develop an analytical framework for stratifying variants along the HNF1A phenotypic continuum to facilitate diagnostic interpretation. HNF1A variant function was determined by four different molecular assays. Structure of the multi-dimensional dataset was explored using principal component analysis, k-means, and hierarchical clustering. Weights for tissue-specific isoform expression and functional domain were integrated. Functionally annotated variant subgroups were used to re-evaluate genetic diagnoses in national MODY diagnostic registries. HNF1A variants demonstrated a range of behaviors across the assays. The structure of the multi-parametric data was shaped primarily by transactivation. Using unsupervised learning methods, we obtained high-resolution functional clusters of the variants that separated known causal MODY variants from benign and type 2 diabetes risk variants and led to reclassification of 4% and 9% of HNF1A variants identified in the UK and Norway MODY diagnostic registries, respectively. Our proof-of-principle analyses facilitated informative stratification of HNF1A variants along the continuum, allowing improved evaluation of clinical significance, management, and precision medicine in diabetes clinics. Transcriptional activity appears a superior readout supporting pursuit of transactivation-centric experimental designs for high-throughput functional screens.

Keywords: HNF1A; bioinformatics; cluster analysis; diabetes; genetics; monogenic diabetes; protein function; rare variants; type 2 diabetes.

PubMed Disclaimer

Conflict of interest statement

The views expressed in this article are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health. M.I.M. has served on advisory panels for Pfizer, NovoNordisk, and Zoe Global, received honoraria from Merck, Pfizer, NovoNordisk, and Eli Lilly, and received research funding from Abbvie, Astra Zeneca, Boehringer Ingelheim, Eli Lilly, Janssen, Merck, NovoNordisk, Pfizer, Roche, Sanofi Aventis, Servier, and Takeda. As of June 2019, M.I.M. is an employee of Genentech and a holder of Roche stock. A.L.G. reports grants from Wellcome Trust, grants from NIHR Oxford Biomedical Research Centre, grants from Horizon 2020, grants from NIDDK, and grants from MRC during the conduct of the study and personal fees from NovoNordisk and Merck outside the submitted work.

Figures

Figure 1
Figure 1
Eigendecomposition of Principal Components Explaining > 85% of Variance Shown are (A) Oxford and (B) Bergen datasets. TA INS1_P2 and TA HeLa_ALB are transcriptional activity data from INS-1 cells using HNF4A P2 promoter and from HeLa cells using rat albumin promoter, respectively. PE, protein expression; nuc loc, nuclear localization data.
Figure 2
Figure 2
K-Means Clustering HNF1A missense alleles characterized at Oxford (A) and Bergen (B) in principal component (PC) space. Blue and green k clusters represent alleles with benign and benign-to-intermediate effects on function, respectively; purple k clusters represent alleles with intermediate functional impact; red k clusters indicate intermediate-to-damaging or functionally damaging alleles.
Figure 3
Figure 3
Hierarchical Clustering Analysis HNF1A missense alleles characterized at Oxford (A) and Bergen (B). WARD minimum variance method was used and analysis performed using orthogonally transformed functional data from PC1-PC4 (>85% explained variance) from Oxford dataset and PC1-PC5 (>85% explained variance) form Bergen dataset. To optimize visualization of the function phenotype gradient, some branches were rotated. The numbers of the y axes of (A) and (B) refer to clustering height calculated as by Ward's criterion (total within-cluster variance).
Figure 4
Figure 4
Distribution of Functionally Annotated HNF1A Missense Alleles (A and B) As a function of frequency in the (A) UK MODY diagnostic registry and (B) Norway MODY diagnostic registry on the x axis and reported frequency in the genome aggregation database (gnomAD) on the y axis. Alleles are colored on the basis of the (re)classification scheme on the top right. (C) Frequency of functionally characterized exome-detected HNF1A missense alleles in gnomAD. The red and orange dashed lines mark known ultra-rare, MODY pathogenic (allele count ≤ 2, AF < 0.0008%) and low frequency type 2 diabetes predisposing allele frequencies (allele count ≤ 121, AF < 0.04%) respectively.

Similar articles

Cited by

References

    1. Murphy R., Ellard S., Hattersley A.T. Clinical implications of a molecular genetic classification of monogenic beta-cell diabetes. Nat. Clin. Pract. Endocrinol. Metab. 2008;4:200–213. - PubMed
    1. Najmi L.A., Aukrust I., Flannick J., Molnes J., Burtt N., Molven A., Groop L., Altshuler D., Johansson S., Bjørkhaug L., Njølstad P.R. Functional investigations of HNF1A identify rare variants as risk factors for Type 2 diabetes in the general population. Diabetes. 2017;66:335–346. - PMC - PubMed
    1. Estrada K., Aukrust I., Bjørkhaug L., Burtt N.P., Mercader J.M., García-Ortiz H., Huerta-Chagoya A., Moreno-Macías H., Walford G., Flannick J., SIGMA Type 2 Diabetes Consortium Association of a low-frequency variant in HNF1A with type 2 diabetes in a Latino population. JAMA. 2014;311:2305–2314. - PMC - PubMed
    1. Balamurugan K., Bjørkhaug L., Mahajan S., Kanthimathi S., Njølstad P.R., Srinivasan N., Mohan V., Radha V. Structure-function studies of HNF1A (MODY3) gene mutations in South Indian patients with monogenic diabetes. Clin. Genet. 2016;90:486–495. - PubMed
    1. Flannick J., Florez J.C. Type 2 diabetes: genetic data sharing to advance complex disease research. Nat. Rev. Genet. 2016;17:535–549. - PubMed

Publication types

MeSH terms

Substances

Supplementary concepts