Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jun 9;19(6):e1010247.
doi: 10.1371/journal.pcbi.1010247. eCollection 2023 Jun.

coiaf: Directly estimating complexity of infection with allele frequencies

Affiliations

coiaf: Directly estimating complexity of infection with allele frequencies

Aris Paschalidis et al. PLoS Comput Biol. .

Abstract

In malaria, individuals are often infected with different parasite strains. The complexity of infection (COI) is defined as the number of genetically distinct parasite strains in an individual. Changes in the mean COI in a population have been shown to be informative of changes in transmission intensity with a number of probabilistic likelihood and Bayesian models now developed to estimate the COI. However, rapid, direct measures based on heterozygosity or FwS do not properly represent the COI. In this work, we present two new methods that use easily calculated measures to directly estimate the COI from allele frequency data. Using a simulation framework, we show that our methods are computationally efficient and comparably accurate to current approaches in the literature. Through a sensitivity analysis, we characterize how the distribution of parasite densities, the assumed sequencing depth, and the number of sampled loci impact the bias and accuracy of our two methods. Using our developed methods, we further estimate the COI globally from Plasmodium falciparum sequencing data and compare the results against the literature. We show significant differences in the estimated COI globally between continents and a weak relationship between malaria prevalence and COI.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. Flowchart of methods.
(A) The relationship between the WSMAF and the PLMAF is shown for an example simulation with a COI of 4. (B) Data have been processed so that loci are deemed variant if they are heterozygous and invariant otherwise. (C) Homozygous data have been filtered out. (D-E) Following the processing of data, Eqs (1) and (2) have been plotted for varying COIs from 1 to 4, respectively.
Fig 2
Fig 2. Estimating the COI on simulated data.
The performance of the Variant Method (A) and Frequency Method (B) is shown for 100 simulations of a COI of 1–20 with 1,000 loci, a read depth of 200, no error added to the simulations, and no sequencing error assumed. Point size indicates density, with the red line representing the line y = x. (C) The mean absolute error for each method is shown. The black bars indicate the 95% confidence interval.
Fig 3
Fig 3. Comparison between THE REAL McCOIL and coiaf.
The COI estimation using (A) the Variant Method and (B) the Frequency Method is compared against the THE REAL McCOIL. (C) The distribution of differences between our estimation and THE REAL McCOIL’s estimation is shown. This difference is computed by subtracting the THE REAL McCOIL’s median estimation of the COI from our estimated value of the COI. The high density observed above 0 for the Frequency Method occurs because the Frequency Method is undefined for a COI of 1. Consequently, for samples that THE REAL McCOIL estimates as having a COI equal to 2, the distribution of our estimates of the COI using the Frequency Method is skewed greater than 2 (B), in contrast to the Variant Method, which exhibits lower skewness (A).
Fig 4
Fig 4. COI across the globe.
The mean (A) and median (B) COI of all samples in each study location within the 24 regions is plotted. The color and size of each point represent the magnitude of the COI. (C) A density plot for each region, where the color of the plot indicates in what subregion the data was sampled. The plots are sorted by the median microscopy prevalence in children aged two to ten as estimated in the Malaria Atlas Project [8, 9, 51] and indicated to the right of each density plot. Map data was obtained from Natural Earth (medium scale data, 1:50m), which is in the public domain.

References

    1. Organization WH. World malaria report 2022. World Health Organization; 2022.
    1. Benelli G, Beier JC. Current vector control challenges in the fight against malaria. Acta Tropica. 2017;174:91–96. doi: 10.1016/j.actatropica.2017.06.028 - DOI - PubMed
    1. Raghavendra K, Barik TK, Reddy BPN, Sharma P, Dash AP. Malaria vector control: from past to future. Parasitology Research. 2011;108(4):757–779. doi: 10.1007/s00436-010-2232-0 - DOI - PubMed
    1. Takken W, Knols BGJ. Malaria vector control: current and future strategies. Trends in Parasitology. 2009;25(3):101–104. doi: 10.1016/j.pt.2008.12.002 - DOI - PubMed
    1. Mutabingwa TK. Artemisinin-based combination therapies (ACTs): best hope for malaria treatment but inaccessible to the needy! Acta Tropica. 2005;95(3):305–315. - PubMed