Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Oct;33(10):2555-64.
doi: 10.1093/molbev/msw127. Epub 2016 Jun 28.

Excess of Deleterious Mutations around HLA Genes Reveals Evolutionary Cost of Balancing Selection

Affiliations

Excess of Deleterious Mutations around HLA Genes Reveals Evolutionary Cost of Balancing Selection

Tobias L Lenz et al. Mol Biol Evol. 2016 Oct.

Abstract

Deleterious mutations are expected to evolve under negative selection and are usually purged from the population. However, deleterious alleles segregate in the human population and some disease-associated variants are maintained at considerable frequencies. Here, we test the hypothesis that balancing selection may counteract purifying selection in neighboring regions and thus maintain deleterious variants at higher frequency than expected from their detrimental fitness effect. We first show in realistic simulations that balancing selection reduces the density of polymorphic sites surrounding a locus under balancing selection, but at the same time markedly increases the population frequency of the remaining variants, including even substantially deleterious alleles. To test the predictions of our simulations empirically, we then use whole-exome sequencing data from 6,500 human individuals and focus on the most established example for balancing selection in the human genome, the major histocompatibility complex (MHC). Our analysis shows an elevated frequency of putatively deleterious coding variants in nonhuman leukocyte antigen (non-HLA) genes localized in the MHC region. The mean frequency of these variants declined with physical distance from the classical HLA genes, indicating dependency on genetic linkage. These results reveal an indirect cost of the genetic diversity maintained by balancing selection, which has hitherto been perceived as mostly advantageous, and have implications both for the evolution of recombination and also for the epidemiology of various MHC-associated diseases.

Keywords: MHC/HLA.; balancing selection; deleterious variation; exome; mutation load; simulations.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Number of significant GWAS associations along the genome. The chromosomal location of significant trait associations from GWAS (N = 18,682) are shown for all autosomes. Data from NHGRI GWAS catalog.
Fig. 2.
Fig. 2.
Simulated polymorphism in regions surrounding the HLA gene under different selection scenarios. Polymorphic sites (SNPs) along the regions around an HLA gene are derived from simulations with three different selection scenarios on the HLA gene (white: no selection on HLA, blue: balancing selection, gray: recurrent sweeps of positive selection). Standard box plots show (a) the median number of all SNPs, (b) of only SNPs with derived allele frequency < 0.01, and (c) the genetic diversity (π) across all sites. Variants in surrounding regions evolved neutrally (snb  = 0) or under codominant purifying selection with snb  = −0.0001, snb  = −0.001, or snb  = −0.01, respectively. Nonoverlapping notches between box plots indicate significant difference. Note the different y axis scales.
Fig. 3.
Fig. 3.
Simulated site frequency spectrum of variants surrounding the HLA gene under different selection scenarios. Derived allele frequencies along the regions around an HLA gene are derived from simulations with three different selection scenarios on the HLA gene (white: no selection on HLA, blue: balancing selection, gray: recurrent sweeps of positive selection). Variants in neighboring regions evolved (a) neutrally (snb  = 0) or under codominant purifying selection with (b) snb  = −0.0001, (c) snb  = −0.001, or (d) snb  = −0.01, respectively. Note the different y axis scales in the zoomed insets of panels (c) and (d) for better visualization.
Fig. 4.
Fig. 4.
Observed average number of polymorphic sites per gene. The median number of polymorphic sites per gene is averaged over the 113 genes represented in the MHC region (blue arrow, excluding classical HLA genes). Also shown is the distribution of equivalent values for 1,000 Monte Carlo-sampled sets of 113 random genes from the entire exome. Represented are (a) all SNPs and (b) only SNPs with derived allele frequency < 0.01.
Fig. 5.
Fig. 5.
Observed distribution of derived allele frequencies. Derived alleles are defined as the nonreference allele at polymorphic sites in the Exome Sequencing Project data. (a) The median frequency of derived alleles per gene is averaged over the 113 genes represented in the MHC region (blue arrow, excluding classical HLA genes). The yellow density curve shows the distribution of equivalent values for 1,000 Monte Carlo-sampled sets of 113 random genes from the entire exome. (b) Derived allele frequency at polymorphic sites (N = 4,175) in the MHC region is shown in relation to distance to the nearest classical HLA locus. Solid and dashed lines indicate linear fit and 95% confidence intervals, respectively. Color shading indicates local data point density for improved visualization of the extent of overlapping points in the plot, ranging from blue (low density) via green and yellow to red (high density).
Fig. 6.
Fig. 6.
Observed site frequency spectrum (SFS) of deleterious variants. The SFS of (a) probably damaging and (b) loss-of-function variants are shown for the entire exome (white bars) and the MHC region only (blue bars, excluding the classical HLA loci).

Comment in

Similar articles

Cited by

  • Current HLA Investigations on SARS-CoV-2 and Perspectives.
    Douillard V, Castelli EC, Mack SJ, Hollenbach JA, Gourraud PA, Vince N, Limou S; Covid-19|HLA & Immunogenetics Consortium and the SNP-HLA Reference Consortium. Douillard V, et al. Front Genet. 2021 Nov 29;12:774922. doi: 10.3389/fgene.2021.774922. eCollection 2021. Front Genet. 2021. PMID: 34912378 Free PMC article. Review.
  • MHConstructor: a high-throughput, haplotype-informed solution to the MHC assembly challenge.
    Wade KJ, Suseno R, Kizer K, Williams J, Boquett J, Caillier S, Pollock NR, Renschen A, Santaniello A, Oksenberg JR, Norman PJ, Augusto DG, Hollenbach JA. Wade KJ, et al. Genome Biol. 2024 Oct 17;25(1):274. doi: 10.1186/s13059-024-03412-6. Genome Biol. 2024. PMID: 39420419 Free PMC article.
  • The 18th International HLA & Immunogenetics workshop project report: Creating fully representative MHC reference haplotypes.
    Pollock NR, Farias TDJ, Kichula KM, Sauter J, Scholz S, Nii-Trebi NI, Khor SS, Tokunaga K, Voorter CE, Groeneweg M, Augusto DG, Arrieta-Bolaños E, Mayor NP, Edinur HA, ElGhazali G, Issler HC, Petzl-Erler ML, Oksenberg JR, Marin WM, Hollenbach JA, Gendzekhadze K, Cita R, Stelet V, Rajalingam R, Koskela S, Clancy J, Chatzistamatiou T, Houwaart T, Kulski J, Guethlein LA, Parham P, Schmidt AH, Dilthey A, Norman PJ. Pollock NR, et al. HLA. 2024 Jun;103(6):e15568. doi: 10.1111/tan.15568. HLA. 2024. PMID: 38923286 Free PMC article.
  • Mapping the dynamic genetic regulatory architecture of HLA genes at single-cell resolution.
    Kang JB, Shen AZ, Gurajala S, Nathan A, Rumker L, Aguiar VRC, Valencia C, Lagattuta KA, Zhang F, Jonsson AH, Yazar S, Alquicira-Hernandez J, Khalili H, Ananthakrishnan AN, Jagadeesh K, Dey K; Accelerating Medicines Partnership Program: Rheumatoid Arthritis and Systemic Lupus Erythematosus (AMP RA/SLE) Network; Daly MJ, Xavier RJ, Donlin LT, Anolik JH, Powell JE, Rao DA, Brenner MB, Gutierrez-Arcelus M, Luo Y, Sakaue S, Raychaudhuri S. Kang JB, et al. Nat Genet. 2023 Dec;55(12):2255-2268. doi: 10.1038/s41588-023-01586-6. Epub 2023 Nov 30. Nat Genet. 2023. PMID: 38036787 Free PMC article.
  • JTK: targeted diploid genome assembler.
    Masutani B, Suzuki Y, Suzuki Y, Morishita S. Masutani B, et al. Bioinformatics. 2023 Jul 1;39(7):btad398. doi: 10.1093/bioinformatics/btad398. Bioinformatics. 2023. PMID: 37354526 Free PMC article.

References

    1. Adzhubey IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, Kondrashov AS, Sunyaev SR. 2010. A method and server for predicting damaging missense mutations. Nat Methods. 7:248–249. - PMC - PubMed
    1. Albrechtsen A, Moltke I, Nielsen R. 2010. Natural selection and the distribution of identity-by-descent in the human genome. Genetics 186:295–308. - PMC - PubMed
    1. Andrés AM, Hubisz MJ, Indap A, Torgerson DG, Degenhardt JD, Boyko AR, Gutenkunst RN, White TJ, Green ED, Bustamante CD, et al. 2009. Targets of balancing selection in the human genome. Mol Biol Evol. 26:2755–2764. - PMC - PubMed
    1. Assaf ZJ, Petrov DA, Blundell JR. 2015. Obstruction of adaptation in diploids by recessive, strongly deleterious alleles. Proc Natl Acad Sci U S A. 112:E2658–E2666. - PMC - PubMed
    1. Asthana S, Schmidt S, Sunyaev SR. 2005. A limited role for balancing selection. Trends Genet. 21:30–32. - PubMed

Publication types