Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Jan;91(1):36-51.
doi: 10.1111/tan.13180.

Deciphering the fine nucleotide diversity of full HLA class I and class II genes in a well-documented population from sub-Saharan Africa

Affiliations

Deciphering the fine nucleotide diversity of full HLA class I and class II genes in a well-documented population from sub-Saharan Africa

T Goeury et al. HLA. 2018 Jan.

Abstract

With the aim to understand how next-generation sequencing (NGS) improves both our assessment of genetic variation within populations and our knowledge on HLA molecular evolution, we sequenced and analysed 8 HLA loci in a well-documented population from sub-Saharan Africa (Mandenka). The results of full-gene NGS-MiSeq sequencing compared with those obtained by traditional typing techniques or limited sequencing strategies showed that segregating sites located outside exon 2 are crucial to describe not only class I but also class II population diversity. A comprehensive analysis of exons 2, 3, 4 and 5 nucleotide diversity at the 8 HLA loci revealed remarkable differences among these gene regions, notably a greater variation concentrated in the antigen recognition sites of class I exons 3 and some class II exons 2, likely associated with their peptide-presentation function, a lower diversity of HLA-C exon 3, possibly related to its role as a KIR ligand, and a peculiar molecular diversity of HLA-A exon 2, revealing demographic signals. Based on full-length HLA sequences, we also propose that the most frequent DRB1 allele in the studied population, DRB1*13:04, emerged from an allelic conversion involving 3 potential alleles as donors and DRB1*11:02:01 as recipient. Finally, our analysis revealed a high occurrence of the DRB1*13:04-DQA1*05:05:01-DQB1*03:19 haplotype, possibly resulting from a selective sweep due to protection to Onchorcerca volvulus, a prevalent pathogen in West Africa. This study unveils highly relevant information on the molecular evolution of HLA genes in relation to their immune function, calling for similar analyses in other populations living in contrasting environments.

Keywords: Onchocerciasis; HLA nucleotide diversity; West Africa; allelic conversion; balancing selection; full-length HLA genes; population genetics; selective sweep.

PubMed Disclaimer

Conflict of interest statement

The authors have declared no conflicting interests.

Figures

Figure 1
Figure 1
Proportions of genotype matches (or « matching scores ») obtained between PCR‐SSO (“SSO” in the Figure), NGS‐454 (“454”) and NGS‐MiSeq (“MiSeq”) at 3 to 6 HLA loci (DQA1 was not typed with PCR‐SSO and class I genes were not typed with NGS‐454). For the comparisons involving NGS‐454, as the sequences obtained with this technique were limited to exon 2 and may thus correspond to different alleles, we reported a match when the allele found with the other technique was compatible with the NGS‐454 sequence. Only perfect matches (i.e. for any compared genotype, when both alleles found with the 2 techniques were either compatible or identical) were counted. The comparisons were replicated 1′000 times each on random samples of 66 individuals (corresponding to the lowest sample size of the observed data) drawn without replacement to assess the variability of genotype matches due to sampling size
Figure 2
Figure 2
Nucleotide (top) and inferred amino acid (bottom) diversity per site (±σ) at exons encoding the peptide‐binding region (left, with a distinction between antigen‐recognition sites (ARS) and non‐antigen‐recognition sites (non‐ARS) sites); the domains interacting with CD4+ and CD8+ T‐cell receptors (middle); and the trans‐membrane region (right) of the HLA‐A, ‐B, ‐C, ‐DRB1, DQA1, DQB1 and DPB1 molecules in the Mandenka population. Brackets remind the chains forming the HLA‐DQ (DQA1 and DQB1) and HLA‐DP (DPA1 and DPB1) dimers
Figure 3
Figure 3
Principal component analysis (axes 1 and 2, explaining respectively 60% and 20% of the total variance) based on Tajima's D, nucleotide diversity π, frequency of segregating sites S.freq, number of non‐synonymous dN and synonymous dS nucleotides at exons 2, 3 and 4 of loci HLA‐A, ‐B, ‐C, ‐DRB1, ‐DQA1, ‐DQB1, ‐DPA1, ‐DPB1 and exons 5 of loci HLA‐A, ‐B, ‐C. Symbols D, D_1, D_2, D_3 represent Tajima's D estimated on the whole gene region and at the first, second and third nucleotide of each codon, respectively. Similarly, π, π_1, π_2 and π_3 as well as S.freq, S.freq_1, S.freq_2 and S.freq_3 represent the nucleotide diversity π and the frequency of segregating sites S estimated on the whole gene region and at nucleotide positions 1, 2 and 3 of each codon, respectively. Grey boxes correspond to ARS codons, white boxes to non‐ARS codons. The inset graph at the bottom left represents the correlations between the projections of the variables (for each pair of variables, the correlation is measured by the cosine of the angle of the 2 variable vectors) on the plan of the PCA
Figure 4
Figure 4
Putative mechanism of the allelic conversion mentioned in this study, which suggests an unidirectional transfer of genetic material including the « AGCGCC » pattern from a donor allele (potentially DRB1*04:05:01, DRB1*08:06 or DRB1*13:03:01 in the Mandenka) to the recipient allele DRB1*11:02:01, leading to the creation of the DRB1*13:04 allele

References

    1. Shiina T, Hosomichi K, Inoko H, Kulski JK. The HLA genomic loci map: expression, interaction, diversity and disease. J Hum Genet. 2009;54(1):15‐39. https://doi.org/10.1038/jhg.2008.5. - DOI - PubMed
    1. Bentley G, Higuchi R, Hoglund B, et al. High‐resolution, high‐throughput HLA genotyping by next‐generation sequencing. Tissue Antigens. 2009;74(5):393‐403. https://doi.org/10.1111/j.1399-0039.2009.01345.x. - DOI - PMC - PubMed
    1. Shiina T, Suzuki S, Ozaki Y, et al. Super high resolution for single molecule‐sequence‐based typing of classical HLA loci at the 8‐digit level using next generation sequencers: Super high‐resolution DNA typing of HLA loci. Tissue Antigens. 2012;80(4):305‐316. https://doi.org/10.1111/j.1399-0039.2012.01941.x. - DOI - PubMed
    1. Gabriel C, Fürst D, Faé I, et al. HLA typing by next‐generation sequencing ‐ getting closer to reality: HLA typing by NGS. Tissue Antigens. 2014;83(2):65‐75. https://doi.org/10.1111/tan.12298. - DOI - PubMed
    1. Hosomichi K, Shiina T, Tajima A, Inoue I. The impact of next‐generation sequencing technologies on HLA research. J Hum Genet. 2015;60(11):665‐673. https://doi.org/10.1038/jhg.2015.102. - DOI - PMC - PubMed

Publication types