Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 28;7(18):eabc3713.
doi: 10.1126/sciadv.abc3713. Print 2021 Apr.

Population genomic evidence of Plasmodium vivax Southeast Asian origin

Affiliations

Population genomic evidence of Plasmodium vivax Southeast Asian origin

Josquin Daron et al. Sci Adv. .

Abstract

Plasmodium vivax is the most common and widespread human malaria parasite. It was recently proposed that P. vivax originates from sub-Saharan Africa based on the circulation of its closest genetic relatives (P. vivax-like) among African great apes. However, the limited number of genetic markers and samples investigated questions the robustness of this hypothesis. Here, we extensively characterized the genomic variations of 447 human P. vivax strains and 19 ape P. vivax-like strains collected worldwide. Phylogenetic relationships between human and ape Plasmodium strains revealed that P. vivax is a sister clade of P. vivax-like, not included within the radiation of P. vivax-like By investigating various aspects of P. vivax genetic variation, we identified several notable geographical patterns in summary statistics in function of the increasing geographic distance from Southeast Asia, suggesting that P. vivax may have derived from a single area in Asia through serial founder effects.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1. Geographical origin of Plasmodium isolates and patterns of genomic variation.
(A) Country of origin of the 447 P. vivax and 19 P. vivax-like isolates used in this study. Within each country, isolates were collected at different locations. The chimpanzee pictogram represents African P. vivax-like isolates. (B) Local variation of genetic relatedness along the genome of P. vivax isolates visualized by multidimensional scaling (MDS) based on the local PCA approach (39). Each point represents a non-overlapping genomic window of 100 SNPs. On the basis of the variation of the MDS-1 coordinate, each window was classified in two groups: subtelomeric hypervariable region (orange) and core region (purple). (C) Genome scan of the SNPs on chromosome 5 (left y axis) identified for both P. vivax and P. vivax-like. The gray area in the background shows the total number of SNPs identified (left y axis), and the black line represents the number of SNPs shared by the two species. The right y axis represents the MDS-1 coordinates against the middle point of each window. Windows were classified in the subtelomeric hypervariable region (orange) or the core region (purple).
Fig. 2
Fig. 2. P. vivax-like strains are structured in two distinct clades that form a sister monophyletic lineage to the human P. vivax.
(A) ML phylogenetic tree illustrating the relationships between P. vivax (PV) and P. vivax-like (PVL) strains, rooted using P. cynomolgi (PC, M strain). Two distinct P. vivax-like groups were identified: PVL.grp1 (in orange) and PVL.grp2 (in brown). The animal pictograms on each leaf indicate the primate host, gorilla, chimpanzee, or mosquito (unknown primate host), colored according to the country of origin (Gabon, Cameroon, and Ivory Coast). Open and closed circles indicate nodes with >80 and >90% bootstrap support, respectively. (B) PCA displaying the two first eigen vectors (EVs) and the proportion of genetic variance they explain. (C) Genome-wide visualization of recent recombination events between P. vivax-like individuals. The horizontal black rectangles represent the 19 P. vivax-like genome sequences, and the vertical colored lines represent the recombining genomic segments on the recipient individual. Colors indicate the lineage membership of the donor individual. (D) Differences in nucleotide diversity (π) between P. vivax-like lineages and P. vivax strains. (E) Multiple sequentially Markovian coalescent (MSMC) estimates of the effective population size (Ne) in the two P. vivax-like groups (PVL.grp1 in orange and PVL.grp2 in brown). Lines in lighter color represent 50 bootstrap resampling replicates of randomly sampled segregating sites.
Fig. 3
Fig. 3. The structure of P. vivax core region of the genome is mostly due to genetic isolation of natural populations.
(A) PCA using the SNPs from the core region of the genome of the 447 P. vivax isolates showing the top two EVs and the accounted proportion of genetic variance. (B) ML phylogenetic tree built using SNP data from the core region of the genome. The star indicates the root of the tree, leading to the P. vivax-like outgroup. Specific internal nodes are highlighted with open circles and represent >80% bootstrap support. (C) The genetic ancestry proportion for individual strains, depicted as a vertical bar, was estimated using ADMIXTURE for each of the K = 6 inferred ancestral populations. (D) These same ancestral proportions for each population are displayed as pie charts on the world map. (E) Isolation by distance among populations. Pairwise estimates of FST/1 − FST were plotted against the corresponding geographical distances between countries. The Spearman correlation coefficient (R2) and the P value estimated using a Mantel test with 1000 permutations are shown.
Fig. 4
Fig. 4. Relationships and gene flow between P. vivax populations.
(A) TreeMix ML tree of P. vivax populations with four migration edges (arrows) rooted with P. vivax-like (PVL), including bootstrap node support. (B) TreeMix residual matrix from the tree in (A). PNG, Papua New Guinea.
Fig. 5
Fig. 5. The Southeast Asian origin of P. vivax is supported by patterns of nucleotide diversity, LD, and AAF.
(A) Genetic diversity of P. vivax regressed on the great circle distance across the world. The value at each pixel of the map corresponds to the Spearman correlation coefficient (R) between the expected genetic diversity in each population and the geographic distance between this population and the pixel. Black dots represent the sampling sites used in the regression analysis (where n ≥ 5 individuals). The landmark on the map represents the most negative correlation coefficient indicative of the most probable source of the range expansion. (B) Correlation between nucleotide diversity (π) and geographic distance, measured as the distance between the milestone indicated in (A) and each population. (C) Correlation between LD at 100 bp (measured as the normalized R2) and the geographic distance. (D) Histograms of AAF proportions in P. vivax populations with an increasing sample size (N = 5, 7, 10, and 20).
Fig. 6
Fig. 6. Coalescent-based inference of the demographic history in each P. vivax population.
(A) Effective population size (Ne) variations back to the TMRCA inferred using MSMC and (B) inferred TMRCA in each population. The analysis was performed using five individuals from each population, assuming a mutation rate per generation (μ·g) of 1.158 × 10−9 and a generation time (g) of 0.18. The gray area represents the first 1000 generations, at which low resolution of the recent past effective population size is observed, likely due to the presence of sites with high probability of being called inaccurately.

Similar articles

Cited by

References

    1. Battle K. E., Lucas T. C. D., Nguyen M., Howes R. E., Nandi A. K., Twohig K. A., Pfeffer D. A., Cameron E., Rao P. C., Casey D., Gibson H. S., Rozier J. A., Dalrymple U., Keddie S. H., Collins E. L., Harris J. R., Guerra C. A., Thorn M. P., Bisanzio D., Fullman N., Huynh C. K., Kulikoff X., Kutz M. J., Lopez A. D., Mokdad A. H., Naghavi M., Nguyen G., Shackelford K. A., Vos T., Wang H., Lim S. S., Murray C. J. L., Price R. N., Baird J. K., Smith D. L., Bhatt S., Weiss D. J., Hay S. I., Gething P. W., Mapping the global endemicity and clinical burden of Plasmodium vivax, 2000–17: A spatial and temporal modelling study. Lancet 394, 332–343 (2019). - PMC - PubMed
    1. Howes R. E., Battle K. E., Mendis K. N., Smith D. L., Cibulskis R. E., Baird J. K., Hay S. I., Global epidemiology of Plasmodium vivax. Am. J. Trop. Med. Hyg. 95, 15–34 (2016). - PMC - PubMed
    1. Horuk R., Chitnis C. E., Darbonne W. C., Colby T. J., Rybicki A., Hadley T. J., Miller L. H., A receptor for the malarial parasite Plasmodium vivax: The erythrocyte chemokine receptor. Science 261, 1182–1184 (1993). - PubMed
    1. Chaudhuri A., Zbrzezna V., Polyakova J., Pogo A. O., Hesselgesser J., Horuk R., Expression of the Duffy antigen in K562 cells. Evidence that it is the human erythrocyte chemokine receptor. J. Biol. Chem. 269, 7835–7838 (1994). - PubMed
    1. R. N. Price, E. Tjitra, C. A. Guerra, S. Yeung, N. J. White, N. M. Anstey, Vivax Malaria: Neglected and Not Benign (American Society of Tropical Medicine and Hygiene, 2007). - PMC - PubMed

Publication types