Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Mar 8;15(3):e1008027.
doi: 10.1371/journal.pgen.1008027. eCollection 2019 Mar.

Genetic signatures of gene flow and malaria-driven natural selection in sub-Saharan populations of the "endemic Burkitt Lymphoma belt"

Affiliations

Genetic signatures of gene flow and malaria-driven natural selection in sub-Saharan populations of the "endemic Burkitt Lymphoma belt"

Mateus H Gouveia et al. PLoS Genet. .

Abstract

Populations in sub-Saharan Africa have historically been exposed to intense selection from chronic infection with falciparum malaria. Interestingly, populations with the highest malaria intensity can be identified by the increased occurrence of endemic Burkitt Lymphoma (eBL), a pediatric cancer that affects populations with intense malaria exposure, in the so called "eBL belt" in sub-Saharan Africa. However, the effects of intense malaria exposure and sub-Saharan populations' genetic histories remain poorly explored. To determine if historical migrations and intense malaria exposure have shaped the genetic composition of the eBL belt populations, we genotyped ~4.3 million SNPs in 1,708 individuals from Ghana and Northern Uganda, located on opposite sides of eBL belt and with ≥ 7 months/year of intense malaria exposure and published evidence of high incidence of BL. Among 35 Ghanaian tribes, we showed a predominantly West-Central African ancestry and genomic footprints of gene flow from Gambian and East African populations. In Uganda, the North West population showed a predominantly Nilotic ancestry, and the North Central population was a mixture of Nilotic and Southern Bantu ancestry, while the Southwest Ugandan population showed a predominant Southern Bantu ancestry. Our results support the hypothesis of diverse ancestral origins of the Ugandan, Kenyan and Tanzanian Great Lakes African populations, reflecting a confluence of Nilotic, Cushitic and Bantu migrations in the last 3000 years. Natural selection analyses suggest, for the first time, a strong positive selection signal in the ATP2B4 gene (rs10900588) in Northern Ugandan populations. These findings provide important baseline genomic data to facilitate disease association studies, including of eBL, in eBL belt populations.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Populations studied in relation to major geographical barriers and analyses of population structure based on genotype data.
(A) Map of Africa showing the geographical origin of the Pan-African populations used in the study comprising 22 populations from previous studies and three new populations in Ghana and the Uganda EMBLEM study (S1 Table). Horizontal dotted lines are the geographical extent of the endemic Burkitt lymphoma (eBL) belt (see S1 Fig and S1 Table for detailed information on Pan-African populations in the eBL belt). The map highlights major geographical features, such as the Congo rainforest (light green), major rivers and lakes, and the East and West African Rift valley systems (pink) that may have shaped migratory routes or constituted barriers to gene flow. (B) Principal Component Analysis (PCA) based on analysis of the genome-wide dataset of Uganda and Ghana integrated with 22 other pan-African populations. For better visualization, open symbols are used for the PCA plot and solid symbols are used in the map. The axes of the plot show the first and third principal components (See S10 Fig for other principal components). The PCA was repeated using similar number of individuals for each studied population (S9 Fig). (C) ADMIXTURE plot showing ancestry clusters in 28 populations from Africa (details in S1, S1A and S1B Table). The populations are listed left to right based on their geographical location in Africa from West to East and North to South. The colors represent different ancestral clusters, with K = 6 being the most likely number of clusters on admixture analysis (See S4 Fig). This ADMIXTURE analysis was repeated using similar number of individuals for each studied population (S6 Fig).
Fig 2
Fig 2. Populations movement routes in relation to major geographical barriers based on genetic inferences.
(A) Map of Africa showing the geographical origin of the 22 previously reported Pan-African populations used herein, plus the three new populations from Ghana and the Uganda EMBLEM study (S1 Table). The arrows in the map indicate population gene flow based on significant f3 statistic tests (S2 Table). Colors of the arrows and the pie charts represent the ancestries inferred by ADMIXTURE (Fig 1C). Rather than shortest geographical distance, shapes of the arrows consider the major geographical barriers such as the Congo rainforest (light green), the East and West African Rift valley systems (pink), and the corresponding Rift Valley lakes and rivers (light blue). (B) The zoomed-out map shows the postulated dispersal routes in greater detail, as well as locations of the populations in the East African Rift Valley plateau sampled in previous studies (2 = Barundi, 3 = Banyarwanda, 4 = Baganda in Uganda; 7 = Luhya, 8 = Kalenjin, and 9 = Kikuyu in Kenya; and 10 = Sandawe in Tanzania), and the Uganda EMBLEM study populations [5 = Uganda North West (UNW), 6 = Uganda North Central (UNC)].
Fig 3
Fig 3. Malaria-driven natural selection analysis of rs10900588-G derived allele of gene ATP2B4. (A) Genome-wide population branch statistic (PBS).
The mean PBS values (20 SNP windows) are represented by a Manhattan plot for Uganda North West (UNW). SNP rs10900588 on chromosome 1 showed the strongest signal of selection (See S15 Fig for LD graphs). (B) World-wide frequencies (red intensity) of the rs10900588-G derived allele, with map of the Pan-African plus UNW and UNC populations. (C) Haplotype bifurcation diagrams for the core haplotype at ATP2B4 gene in Uganda North West (UNW), Uganda North Central (UNC), and Shoto/Zulu. ASW—Americans of African Ancestry in SW USA; MXL—Mexican Ancestry from Los Angeles USA; PUR—Puerto Ricans from Puerto Rico; ACB—African Caribbeans in Barbados; CLM—Colombians from Medellin, Colombia; PEL—Peruvians from Lima, Peru; KHV—Kinh in Ho Chi Minh City, Vietnam; JPT—Japanese in Tokyo, Japan; CDX—Chinese Dai in Xishuangbanna, China; CHB—Han Chinese in Beijing, China; CHS—Southern Han Chinese; GIH—Gujarati Indian from Houston, Texas.

References

    1. Burkitt D. A children’s cancer dependent on climatic factors. Nature. 1962;194: 232–234. - PubMed
    1. Burkitt DP. Etiology of Burkitt’s lymphoma--an alternative hypothesis to a vectored virus. J Natl Cancer Inst. 1969;42: 19–28. - PubMed
    1. Jallow M, Teo YY, Small KS, Rockett KA, Deloukas P, Clark TG, et al. Genome-wide and fine-resolution association analysis of malaria in West Africa. Nat Genet. 2009;41: 657–665. 10.1038/ng.388 - DOI - PMC - PubMed
    1. Legason ID, Pfeiffer RM, Udquim K-I, Bergen AW, Gouveia MH, Kirimunda S, et al. Evaluating the Causal Link Between Malaria Infection and Endemic Burkitt Lymphoma in Northern Uganda: A Mendelian Randomization Study. EBioMedicine. 2017;25: 58–65. 10.1016/j.ebiom.2017.09.037 - DOI - PMC - PubMed
    1. Parkin DM, Sitas F, Chirenje M, Stein L, Abratt R, Wabinga H. Part I: Cancer in Indigenous Africans--burden, distribution, and trends. Lancet Oncol. 2008;9: 683–692. 10.1016/S1470-2045(08)70175-X - DOI - PubMed

Publication types

MeSH terms

Substances