Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 5;104(5):1830-1835.
doi: 10.4269/ajtmh.21-0117.

Nonparametric Binary Classification to Distinguish Closely Related versus Unrelated Plasmodium falciparum Parasites

Affiliations

Nonparametric Binary Classification to Distinguish Closely Related versus Unrelated Plasmodium falciparum Parasites

Mateusz M Plucinski et al. Am J Trop Med Hyg. .

Abstract

Assessing genetic relatedness of Plasmodium falciparum genotypes is a key component of antimalarial efficacy trials. Previous methods have focused on determining a priori definitions of the level of genetic similarity sufficient to classify two infections as sharing the same strain. However, factors such as mixed-strain infections, allelic suppression, imprecise typing methods, and heterozygosity complicate comparisons of apicomplexan genotypes. Here, we introduce a novel method for nonparametric statistical testing of relatedness for P. falciparum parasites. First, the background distribution of genetic distance between unrelated strains is computed. Second, a threshold genetic distance is computed from this empiric distribution of distances to demarcate genetic distances that are unlikely to have arisen by chance. Third, the genetic distance between paired samples is computed, and paired samples with genetic distances below the threshold are classified as related. The method is designed to work with any arbitrary genetic distance definition. We validated this procedure using two independent approaches to calculating genetic distance. We assessed the concordance of the novel nonparametric classification with a gold-standard Bayesian approach for 175 pairs of recurrent P. falciparum episodes from previously published malaria efficacy trials with microsatellite data from five studies in Guinea and Angola. The novel nonparametric approach was 98% sensitive and 84-89% specific in correctly identifying related genotypes compared with a gold-standard Bayesian algorithm. The approach provides a unified and systematic method to statistically assess relatedness of P. falciparum parasites using arbitrary genetic distance methodologies.

PubMed Disclaimer

Conflict of interest statement

Disclaimer: The findings and conclusions in this article are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.

Disclosure: M. M. P. and J. L. N. B. conceived the study, analyzed the data, and drafted the report.

Figures

Figure 1.
Figure 1.
Distribution of pairwise genetic distance for all day 0–day 0 (D0) pairs, with empiric bottom 5% threshold denoted (A and C), and distribution distance between paired D0–day of failure (DOF) samples (B and D), using two different, independent definitions of genetic distance.
Figure 2.
Figure 2.
Distribution of pairwise genetic distance for paired D0–day of failure (DOF) samples, stratifying by Bayesian posterior probability of recrudescence, using two different, independent definitions of genetic distance.

References

    1. Galal L, Hamidović A, Dardé ML, Mercier M, 2019. Diversity of Toxoplasma gondii strains at the global level and its determinants. Food Waterborne Parasitol 15: e00052. - PMC - PubMed
    1. Ajzenberg D, Bañuls AL, Su C, Dumètre A, Demar M, Carme B, Dardé ML, 2004. Genetic diversity, clonality and sexuality in Toxoplasma gondii. Int J Parasitol 34: 1185–1196. - PubMed
    1. Barratt JLN, Sapp SGH, 2020. Machine learning-based analyses support the existence of species complexes for Strongyloides fuelleborni and Strongyloides stercoralis. Parasitology 147: 1184–1195. - PMC - PubMed
    1. Nascimento FS, et al. 2020. Evaluation of an ensemble-based distance statistic for clustering MLST datasets using epidemiologically defined clusters of cyclosporiasis. Epidemiol Infect 148: e172. - PMC - PubMed
    1. World Health Organization , 2008. Methods and Techniques for Clinical Trials on Antimalarial Drug Efficacy: Genotyping to Identify Parasite Populations. Geneva, Switzerland: WHO.

Publication types

LinkOut - more resources