Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Apr 12;544(7649):235-239.
doi: 10.1038/nature22034.

Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity

Affiliations

Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity

Danish Saleheen et al. Nature. .

Abstract

A major goal of biomedicine is to understand the function of every gene in the human genome. Loss-of-function mutations can disrupt both copies of a given gene in humans and phenotypic analysis of such 'human knockouts' can provide insight into gene function. Consanguineous unions are more likely to result in offspring carrying homozygous loss-of-function mutations. In Pakistan, consanguinity rates are notably high. Here we sequence the protein-coding regions of 10,503 adult participants in the Pakistan Risk of Myocardial Infarction Study (PROMIS), designed to understand the determinants of cardiometabolic diseases in individuals from South Asia. We identified individuals carrying homozygous predicted loss-of-function (pLoF) mutations, and performed phenotypic analysis involving more than 200 biochemical and disease traits. We enumerated 49,138 rare (<1% minor allele frequency) pLoF mutations. These pLoF mutations are estimated to knock out 1,317 genes, each in at least one participant. Homozygosity for pLoF mutations at PLA2G7 was associated with absent enzymatic activity of soluble lipoprotein-associated phospholipase A2; at CYP2F1, with higher plasma interleukin-8 concentrations; at TREH, with lower concentrations of apoB-containing lipoprotein subfractions; at either A3GALT2 or NRG4, with markedly reduced plasma insulin C-peptide concentrations; and at SLC9A3R1, with mediators of calcium and phosphate signalling. Heterozygous deficiency of APOC3 has been shown to protect against coronary heart disease; we identified APOC3 homozygous pLoF carriers in our cohort. We recruited these human knockouts and challenged them with an oral fat load. Compared with family members lacking the mutation, individuals with APOC3 knocked out displayed marked blunting of the usual post-prandial rise in plasma triglycerides. Overall, these observations provide a roadmap for a 'human knockout project', a systematic effort to understand the phenotypic consequences of complete disruption of genes in humans.

PubMed Disclaimer

Conflict of interest statement

The authors do not declare competing financial interests.

Figures

Extended Data Fig. 1
Extended Data Fig. 1. pLoF mutations are typically seen in very few individuals
The site-frequency spectrum of synonymous, missense, and high-confidence pLoF mutations is represented. Points represent the proportion of variants within a 1 × 10−4 minor allele frequency bin for each variant category. Lines represent the cumulative proportions of variants categories. The bottom inset highlights that most pLoF variants are often seen in no more than one or two individuals. The top inset highlights that virtually all pLoF mutations are very rare.
Extended Data Fig. 2
Extended Data Fig. 2. Intersection of homozygous pLoF genes between PROMIS and other cohorts
We compared the counts and overlap of unique homozygous pLoF genes in PROMIS with other exome sequenced cohorts.
Extended Data Fig. 3
Extended Data Fig. 3. QQ-plot of recessive model pLoF association analysis across phenotypes
Analyses to determine whether homozygous pLoF carrier status was associated with traits was performed where there were at least two homozygous pLoF carriers phenotyped per trait. The observed versus the expected results from 15,263 associations are displayed here demonstrating an excess of associations beyond a Bonferroni threshold.
Extended Data Fig. 4
Extended Data Fig. 4. Carriers of pLoF alleles in CYP2F1 have increased interleukin-8 concentrations
Participants who had pLoF mutations in the CYP2F1 gene had higher concentrations of interleukin-8 while heterozygotes had a more modest effect when compared to the rest of the cohort of non-carriers. Interleukin 8 concentration is natural log transformed.
Extended Data Fig. 5
Extended Data Fig. 5. Carriers of pLoF alleles in TREH have decreased concentrations of several lipoprotein subfractions
Participants who had pLoF mutations in the TREH gene had lower concentrations of several lipoprotein subfractions.
Extended Data Fig. 6
Extended Data Fig. 6. Nondiabetic homozygous pLoF carriers for A3GALT2 have diminished insulin C-peptide concentrations
Among nondiabetics, those who were homozygous pLoF for A3GALT2 had substantially lower fasting insulin C-peptide concentrations. This observation was not evident in nondiabetic heterozygous pLoF A3GALT2 participants. Insulin C-peptide is natural log transformed.
Extended Data Fig. 7
Extended Data Fig. 7. Example of a second polymorphism in-phase which rescues a putative protein-truncating mutation
Short-reads aligning to genomic positions 65,339,112 to 65,339,132 on chromosome 1 are displayed for one individual with a putative homozygous pLoF genotype in this region. The single nucleotide polymorphism at position 65,339,122 from G to T is annotated as a nonsense mutation in the JAK1 gene. However, all three homozygotes of this mutation carried a tandem single nucleotide polymorphism in the same codon (A to G at 65,339,124) thus resulting in a glutamine and effectively rescuing the protein-truncating mutation.
Extended Data Fig. 8
Extended Data Fig. 8. Anticipated number of genes knocked out with increasing sample sizes by minimum knockout count
We simulate the number of genes expected to be knocked out by minimum knockout count per gene at increasing sample sizes. We perform this simulation with and without the observed inbreeding.
Extended Data Fig. 9
Extended Data Fig. 9. PROMIS participants have an excess burden of runs of homozygosity compared with other populations
Consanguinity leads to regions of genomic segments that are identical by descent and can be observed as runs of homozygosity. Using genome-wide array data in 17,744 PROMIS participants and reference samples from the International HapMap 3, the burden of runs of homozygosity (minimum 1.5 Mb) per individual was derived and population-specific distributions are displayed, with outliers removed. This highlights the higher median runs of homozygosity burden in PROMIS and the higher proportion of individuals with very high burdens.
Extended Data Fig. 10
Extended Data Fig. 10. Down-sampling of synonymous and high confidence pLoF variants to validate simulation
We ran simulations to estimate the number of unique, completely knocked out genes at increasing sample sizes. Prior to applying our model, we first applied this approach to a range of sample sizes below 7,078 for variants that were not under constraint, synonymous variants (a.), and for high-confidence null variants (b.). At the observed sample size, we did not appreciate significant selection. We expect that at increasing sample sizes, there may be a subset of genes that will not be tolerated in a homozygous pLoF state. In fact, our estimates are slightly more conservative when comparing outbred simulations with a recent description of >100,000 Icelanders using a more liberal definition for pLoF mutations.
Fig 1
Fig 1. Homozygous pLoF burden in PROMIS is driven by excess autozygosity
a, Most genes are observed in the homozygous pLoF state in only single individuals. b. The distribution of F inbreeding coefficient of PROMIS participants is compared to those of outbred samples of African (AFR) and European (EUR) ancestry. c, The burden of homozygous pLoF genes per individual is correlated with coefficient of inbreeding.
Fig 2
Fig 2. Carriers of PLA2G7 splice mutation have diminished Lp-PLA2 mass and activity but similar risk for coronary heart disease risk when compared to non-carriers
a.–b. Carriage of a splice-site mutation, c.663+1G>A, in PLA2G7 leads to a dose-dependent reduction of both lipoprotein-associated phospholipase A2 (Lp-PLA2) mass (P = 6 × 10−5) and activity (P = 2 × 10−7), with homozygotes having no circulating Lp-PLA2. c. Despite substantial reductions of Lp-PLA2 activity, PLA2G7 c.663+1G>A heterozygotes and homozygotes have similar coronary heart disease risk when compared with non-carriers (P = 0.87).
Fig 3
Fig 3. APOC3 pLoF homozygotes have diminished fasting triglycerides and blunted post-prandial lipemia
a.–d. APOC3 pLoF genotype status, apolipoprotein C-III, triglycerides, HDL cholesterol and LDL cholesterol distributions among all sequenced participants. Apolipoprotein C-III concentration is displayed on a logarithmic base 10 scale. e. A proband with APOC3 pLoF homozygote genotype as well as several family members were recalled for provocative phenotyping. Surprisingly, the spouse of the proband was also a pLoF homozygote, leading to nine obligate homozygote children. Given the extensive number of first-degree unions, the pedigree is simplified for clarity. f. APOC3 p.Arg19Ter homozygotes and non-carriers within the same family were challenged with a 50 g/m2 fat feeding. Homozygotes had lower baseline triglyceride concentrations and displayed marked blunting of post-prandial rise in plasma triglycerides.
Fig 4
Fig 4. Simulations anticipate many more homozygous pLoF genes in the PROMIS cohort
Number of unique homozygous pLoF genes anticipated with increasing sample sizes sequenced in PROMIS compared with similar African (AFR) and European (EUR) sample sizes. Estimates derived using observed allele frequencies and degree of inbreeding.

Comment in

References

    1. Eisenberg D, Marcotte EM, Xenarios I, Yeates TO. Protein Function in the Post-Genomic Era. Nature. 2000;405:823–826. doi: 10.1038/35015694. - DOI - PubMed
    1. Bittles AH, Mason WM, Greene J, Rao NA. Reproductive Behavior and Health in Consanguineous Marriages. Science. 1991;252:789–794. - PubMed
    1. Saleheen D, et al. The Pakistan Risk of Myocardial Infarction Study: A Resource for the Study of Genetic, Lifestyle and Other Determinants of Myocardial Infarction in South Asia. Eur J Epidemiol. 2009;24:329–338. doi: 10.1007/s10654-009-9334-y. - DOI - PMC - PubMed
    1. Crosby J, et al. Loss-of-Function Mutations in APOC3, Triglycerides, and Coronary Disease. N Engl J Med. 2014;371:22–31. doi: 10.1056/NEJMoa1307095. - DOI - PMC - PubMed
    1. Jorgensen AB, Frikke-Schmidt R, Nordestgaard BG, Tybjaerg-Hansen A. Loss-of-Function Mutations in Apoc3 and Risk of Ischemic Vascular Disease. N Engl J Med. 2014;371:32–41. doi: 10.1056/NEJMoa1308027. - DOI - PubMed

Publication types

MeSH terms