Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Apr 24;9(1):1612.
doi: 10.1038/s41467-018-03910-9.

Medical relevance of protein-truncating variants across 337,205 individuals in the UK Biobank study

Affiliations

Medical relevance of protein-truncating variants across 337,205 individuals in the UK Biobank study

Christopher DeBoever et al. Nat Commun. .

Abstract

Protein-truncating variants can have profound effects on gene function and are critical for clinical genome interpretation and generating therapeutic hypotheses, but their relevance to medical phenotypes has not been systematically assessed. Here, we characterize the effect of 18,228 protein-truncating variants across 135 phenotypes from the UK Biobank and find 27 associations between medical phenotypes and protein-truncating variants in genes outside the major histocompatibility complex. We perform phenome-wide analyses and directly measure the effect in homozygous carriers, commonly referred to as "human knockouts," across medical phenotypes for genes implicated as being protective against disease or associated with at least one phenotype in our study. We find several genes with strong pleiotropic or non-additive effects. Our results illustrate the importance of protein-truncating variants in a variety of diseases.

PubMed Disclaimer

Conflict of interest statement

C.D.B. is a member of the scientific advisory boards for Liberty Biosecurity, Personalis, 23andMe Roots into the Future, Ancestry.com, IdentifyGenomics, and Etalon and is a founder of CDB Consulting. M.J.D. is a member the scientific advisory board for Ancestry.com. M.A.R. is a paid consultant for Genomics PLC and Prime Genomics. E.I. is a scientific advisor for Precision Wellness and Olink Proteomics for work unrelated to the present project. The remaining authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Schematic overview of the study. We prepared a data set of 18,228 protein-truncating variants and 135 medical phenotypes from the UK Biobank data set of 337,205 individuals. From these data, we analyzed the clinical effects of predicted protein-truncating genetic variants
Fig. 2
Fig. 2
Identification of risk and protective alleles for 135 phenotypes. a–c Manhattan plots for logistic regression for all PTVs and all phenotypes stratified by minor allele frequency a > 1%, b between 0.1% and 1%, and c between 0.01% and 0.1%. Scatter points are colored according to phenotype. Fourteen associations with -log10 p values> 20 were plotted at 20. PTVs in genes near or in the MHC region have smaller scatter points. d Effect size “cascade plot” for all associations outside the MHC with BY-adjusted p < 0.05. Error bars represent 95% confidence intervals. e–f Manhattan plots for PTVs in or near the MHC with minor allele frequency e > 1% and f between 0.1 and 1%. The p values for gray points are the same as in a and b, respectively. The p values for the color points have been re-calculated conditional on HLA alleles
Fig. 3
Fig. 3
PheWAS for IFIH1. Phenome-wide associations (logistic regression, p < 0.01) for four PTVs in IFIH1 with minor allele frequency > 0.01%. The left panel shows the number of cases per phenotype in thousands. The middle panel shows the logistic regression −log10 p values. The right panel shows the estimated odds ratios and 95% confidence intervals
Fig. 4
Fig. 4
Non-additive associations for FUT2. Association results under additive and non-additive logistic regression models for predicted FUT2 heterozygous or homozygous knockouts (KOs) with a difference between non-additive model AIC and additive model AIC < −1. The left panel shows the number of cases per phenotype. The middle-left panel shows the -log10 value for the KO association analysis. The middle-right panel shows the estimated log odds ratios and 95% confidence intervals under an additive model (orange) and under a non-additive model for heterozygote KOs (blue) and homozygote KOs (green)

References

    1. Rivas MA, et al. A protein-truncating R179X variant in RNF186 confers protection against ulcerative colitis. Nat. Commun. 2016;7:12342. doi: 10.1038/ncomms12342. - DOI - PMC - PubMed
    1. Rivas MA, et al. Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease. Nat. Genet. 2011;43:1066–1073. doi: 10.1038/ng.952. - DOI - PMC - PubMed
    1. Cohen J, et al. Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9. Nat. Genet. 2005;37:161–165. doi: 10.1038/ng1509. - DOI - PubMed
    1. Cohen JC, Boerwinkle E, Mosley THJ, Hobbs HH. Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N. Engl. J. Med. 2006;354:1264–1272. doi: 10.1056/NEJMoa054013. - DOI - PubMed
    1. Saleheen, D. et al. Human knockouts in a cohort with a high rate of consanguinity. Nature544, 235–239 (2015). - PMC - PubMed

Publication types