Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Sep 1;35(9):2145-2158.
doi: 10.1093/molbev/msy116.

Divergent Allele Advantage at Human MHC Genes: Signatures of Past and Ongoing Selection

Affiliations

Divergent Allele Advantage at Human MHC Genes: Signatures of Past and Ongoing Selection

Federica Pierini et al. Mol Biol Evol. .

Abstract

The highly polymorphic genes of the major histocompatibility complex (MHC) play a key role in adaptive immunity. Divergent allele advantage, a mechanism of balancing selection, is proposed to contribute to their exceptional polymorphism. It assumes that MHC genotypes with more divergent alleles allow for broader antigen-presentation to immune effector cells, by that increasing immunocompetence. However, the direct correlation between pairwise sequence divergence and the corresponding repertoire of bound peptides has not been studied systematically across different MHC genes. Here, we investigated this relationship for five key classical human MHC genes (human leukocyte antigen; HLA-A, -B, -C, -DRB1, and -DQB1), using allele-specific computational binding prediction to 118,097 peptides derived from a broad range of human pathogens. For all five human MHC genes, the genetic distance between two alleles of a heterozygous genotype was positively correlated with the total number of peptides bound by these two alleles. In accordance with the major antigen-presentation pathway of MHC class I molecules, HLA-B and HLA-C alleles showed particularly strong correlations for peptides derived from intracellular pathogens. Intriguingly, this bias coincides with distinct protein compositions between intra- and extracellular pathogens, possibly suggesting adaptation of MHC I molecules to present specifically intracellular peptides. Eventually, we observed significant positive correlations between an allele's average divergence and its population frequency. Overall, our results support the divergent allele advantage as a meaningful quantitative mechanism through which pathogen-mediated selection leads to the evolution of MHC diversity.

Keywords: HLA; balancing selection; heterozygote advantage; human evolution; pathogen-mediated selection.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.
Fig. 1.
In silico evidence for divergent allele advantage across five classical human MHC genes. Correlation between pairwise genetic distances reported as Grantham distance (x axes) and number of bound peptides (y axes) counted for all possible pairs of common HLA alleles. Each dot represents an allele pair. Binding prediction analyses performed on the complete data set of pathogen proteins (n = 232). Linear model (red line) and smoothed lowess curve (dashed blue line), describing the association between the combined number of bound peptides and pairwise Grantham sequence divergence. Note the different axis scales.
<sc>Fig</sc>. 2.
Fig. 2.
Different origins of pathogenic peptides. Correlation between pairwise genetic distances reported as Grantham distance (x axes) and number of bound peptides (y axes) calculated for all possible pairs of common HLA alleles. Each dot represents an allele pair. Binding prediction analyses performed considering proteins within three groups of pathogens: extracellular (orange, n = 57 proteins), intracellular (green, n = 100), and intra-extracellular (purple, n = 75).
<sc>Fig</sc>. 3.
Fig. 3.
Multidimensional scaling plot of amino acid composition in intracellular and extracellular pathogen proteins. Multidimensional scaling (MDS) based on amino acid frequencies indicates similarity in amino acid composition among individual proteins (dots). Intracellular proteins (n = 100) are reported in green while extracellular proteins (n = 57) in orange. MDS enables a standardized unit-less representation of variation among data points in 2D space (along perpendicular axes MDS1 and MDS2): location of proteins within the plot is indicative of potential bias toward specific amino acids (blue characters in one letter code), proteins with more similar amino acid composition are displayed closer to each other. The dashed circles indicate 95% confidence intervals for each group. Stress for 2D representation: 0.21.
<sc>Fig</sc>. 4.
Fig. 4.
Population frequency of divergent HLA alleles. Correlation between the average Grantham pairwise divergence to the most common alleles and the allele frequency in the USA NMDP European Caucasian (N = 1,242,890), German (N= 39,689), and Polish (N= 20,653) populations, for four classical HLA loci with available allele frequency data in AlleleFrequencies.net. Significant associations that persisted after Bonferroni correction across populations are reported with a solid line, while dashed lines indicate associations that are only nominally significant (P < 0.05 before Bonferroni correction; for exact values see supplementary table S9, Supplementary Material online).

Similar articles

Cited by

References

    1. Abi-Rached L, Jobin MJ, Kulkarni S, McWhinnie A, Dalva K, Gragert L, Babrzadeh F, Gharizadeh B, Luo M, Plummer FA.. 2011. The shaping of modern human immune systems by multiregional admixture with archaic humans. Science 3346052:89–94. - PMC - PubMed
    1. Aken BL, Ayling S, Barrell D, Clarke L, Curwen V, Fairley S, Fernandez Banet J, Billis K, García Girón C, Hourlier T, et al. 2016. The Ensembl gene annotation system. Database 2016:baw093.. - PMC - PubMed
    1. Bause E. 1983. Structural requirements of N-glycosylation of proteins. Studies with proline peptides as conformational probes. Biochem J. 2092:331–336. - PMC - PubMed
    1. Beck S, Trowsdale J.. 2000. The human major histocompatibility complex: lessons from the DNA sequence. Annu Rev Genomics Hum Genet. 11:117–137. - PubMed
    1. Berger SA. 2005. GIDEON: a comprehensive Web-based resource for geographic medicine. Int J Health Geogr. 41:10.. - PMC - PubMed