Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Mar 4;17(3):e1008726.
doi: 10.1371/journal.pcbi.1008726. eCollection 2021 Mar.

Group testing as a strategy for COVID-19 epidemiological monitoring and community surveillance

Affiliations

Group testing as a strategy for COVID-19 epidemiological monitoring and community surveillance

Vincent Brault et al. PLoS Comput Biol. .

Abstract

We propose an analysis and applications of sample pooling to the epidemiologic monitoring of COVID-19. We first introduce a model of the RT-qPCR process used to test for the presence of virus in a sample and construct a statistical model for the viral load in a typical infected individual inspired by large-scale clinical datasets. We present an application of group testing for the prevention of epidemic outbreak in closed connected communities. We then propose a method for the measure of the prevalence in a population taking into account the increased number of false negatives associated with the group testing method.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1
(A) Sketch of an RT-qPCR fluorescence intensity signal for a positive patient without pooling (solid red line) a single positive patient in a pool of 64 patients (dashed red curve) and for a negative sample representing the response of an artefact (dotted magenta curve); as pooling dilutes the initial concentration, the pooled response (dashed red curve) is expected to be close to the translation xx + 6 from that of a single patient (solid red line). (B) Sketch of the distribution of threshold values for RT-qPCR individual tests (solid blue line) or in pools 64 (dashed red curve); part of the distribution crosses the limit of detection of the test (figured as the grey area) at the detection threshold dcens.
Fig 2
Fig 2
(A) Representation of the density for the classical mixing Gaussian model (dashed lines) and the partially censored model (solid lines) each composed as a sum of 3 components for the Gaussian model (orange/green/red dashed lines) and the partially censored model (orange/green/red solid lines); (purple vertical line) location of the threshold datt ≈ 35.6. Data based on the histogram presented in [56]. (B) Focus on the false negative region, with the estimated false negative probability in the partially censored model (solid line) due to the defect of detection above the threshold datt (red color filled area). (C) Mixing Gaussian model on the ImpactSaliva dataset presented in [54]. (D-E) Mixing Gaussian model on the ImpactSaliva dataset presented in [55] for (D) asymptomatic and (E) symptomatic individuals at the moment of the test. Raw data available in S1 Data.
Fig 3
Fig 3
(A) Model 1 for the evolution of the viral load post-infection. (B) Modelled distribution in the infection age at the moment of the test for (red) symptomatic individuals as a Gamma function; (blue-cyan) fully asymptomatic (i.e. throughout the infection) individuals either as (blue) a constant if the new infections rate is a constant with time or (cyan) as an exponential if the rate of new infections decays exponentially with time (characteristic decay time τ = 10 days). (C) In the Model 1 context, the distribution of the viral load in asymptomatic individuals is relatively uniform. (D) Model 2 for the evolution of the viral load post-infection distinguishing between symptomatic and asymptomatic (combining [58] and [59]). Parameters estimate are provided in Table H in S1 Text. (E) In the model 2 context, the distribution of the viral load in asymptomatic individuals shows 2 peaks at high and low viral loads. (F) The distribution of the viral load in symptomatic individuals is less bimodal than the observed asymptomatic distribution.
Fig 4
Fig 4
(A-C) Relative increase in the false negative risk (1-Φ(dcens(N)))/(1-Φ(dcens(1))) in pools of size N including a single infected individual whose viral distribution is estimated using the naive (solid line), partially censored (circle) or fully censored (crossed line) fitting method of the following datasets from (A) Watkins et al. [54], (B) Jones et al. [56] and (B) Lennon et al. [55]. In (A), we superpose the clinical estimation of the risk of false negative provided in [54] (red crosses). Here, in contrast to [26], we do not change the threshold level of positivity compared to the individual test.
Fig 5
Fig 5
Detection probability within a community of 120 as a function of the total number of sampled individuals M = k × N, where k is the total number of tests used and N the number of samples pooled together in a test (A) Case of a single patient 0 with low viral load; k = 5 (red dotted line); k = 4 (orange line with arrow), k = 3 (purple line with circles); k = 2 (dashed green line); k = 1 (solid blue line) for several values in the parameters describing the viral load of the patient 0 at the onset of contagiosity, expressed in terms of a normal distribution in Ct (the number of RT-qPCR amplification cycles) with a standard deviation σ0 and a mean μ0 and a threshold at a value denoted dcens satisfying: μ0 = dcens − 1 (top row), modelling a patient 0 with a very low viral concentration, μ0 = dcens − 3 (middle row), μ0 = dcens − 5 (bottom row); σ0 = 2 (left column); σ0 = 6 (right column). (B) Case of a single patient 0 with a viral load distributed datasets(left) for the three fitting methods used to describe the asymptomatic dataset corresponding to Lennon et al. [55], for k = 1 (blue) and k = 5 (red)and (right) comparing the datasets of Lennon et al. [55] and Jones et al. [56] for the naive fitting method (upper curve k = 5, lower curve k = 1).
Fig 6
Fig 6
(A,B) Total number of tests (red) and total number of sampled individuals (blue) in order to estimate a prevalence of p = 1% with a ±0.2% precision with 95% confidence interval as a function of the pool size N for the perfect case (dashed lines) with no false negative versus the case with false negatives (solid lines) estimated according to the Lennon et al. asymptomatic dataset [55]. In (A) N ranges from 0 to 25; in (B) N ranges from 0 to 128. The optimal pool size Nopt(perf) is beyond the N-axis limit.

References

    1. Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, et al. Virological assessment of hospitalized patients with COVID-2019. Nature. 2020;581(7809):465–469. 10.1038/s41586-020-2196-x - DOI - PubMed
    1. World Health Organization. Laboratory testing for 2019 novel coronavirus (2019-nCoV) in suspected human cases. WHO—Interim guidance. 2020;2019(January):1–7.
    1. Corman VM, Landt O, Kaiser M, Molenkamp R, Meijer A, Chu DKW, et al. Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR. Eurosurveillance. 2020;25(3):1–8. 10.2807/1560-7917.ES.2020.25.3.2000045 - DOI - PMC - PubMed
    1. Gollier C, Gossner O. Group Testing against Covid-19. Covid Economics. 2020; p. 32–42.
    1. Pouwels KB, Roope LSJ, Barnett A, Hunter DJ, Nolan TM, Clarke PM. Group Testing for SARS-CoV-2: Forward to the Past? PharmacoEconomics—Open. 2020; p. 2–5. - PMC - PubMed