Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Apr 20;12(1):6519.
doi: 10.1038/s41598-022-10128-9.

Smart pooling: AI-powered COVID-19 informative group testing

Affiliations

Smart pooling: AI-powered COVID-19 informative group testing

María Escobar et al. Sci Rep. .

Abstract

Massive molecular testing for COVID-19 has been pointed out as fundamental to moderate the spread of the pandemic. Pooling methods can enhance testing efficiency, but they are viable only at low incidences of the disease. We propose Smart Pooling, a machine learning method that uses clinical and sociodemographic data from patients to increase the efficiency of informed Dorfman testing for COVID-19 by arranging samples into all-negative pools. To do this, we ran an automated method to train numerous machine learning models on a retrospective dataset from more than 8000 patients tested for SARS-CoV-2 from April to July 2020 in Bogotá, Colombia. We estimated the efficiency gains of using the predictor to support Dorfman testing by simulating the outcome of tests. We also computed the attainable efficiency gains of non-adaptive pooling schemes mathematically. Moreover, we measured the false-negative error rates in detecting the ORF1ab and N genes of the virus in RT-qPCR dilutions. Finally, we presented the efficiency gains of using our proposed pooling scheme on proof-of-concept pooled tests. We believe Smart Pooling will be efficient for optimizing massive testing of SARS-CoV-2.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Smart Pooling makes pooling methods efficient even at high disease prevalences. (a) In standard Dorfman’s testing methods, samples are pooled randomly. As prevalence increases, the efficiency of pooling without a priori information drops rapidly, making the strategy unviable. (b) Smart Pooling tackles this problem by thresholding samples according to the automated predicted risk probability, based on the sociodemographic characteristics. If the risk probability of a sample surpasses the defined threshold, this sample goes directly to individual testing. We arrange samples into pools with similar risk probability. (c) The Smart Pooling pipeline directly integrates machine learning with laboratory procedures.
Figure 2
Figure 2
Smart Pooling decreases the expected number of tests per specimen. (a) Smart Pooling’s expected number of tests per specimen compared to standard testing methods on Patient Dataset. Efficiency improves by reducing the expected number of tests per specimen. Smart Pooling achieves higher efficiencies than Dorfman’s and individual testing for prevalences of the disease of up to 50%, with a fixed pool size of 10. (b) Smart Pooling’s expected number of tests per specimen trained with the coarse metadata from the Test Center Dataset. Efficiency improves by reducing the expected number of tests per specimen. Despite not having detailed patient metadata, Smart Pooling can produce higher efficiencies than Dorfman’s pooling for the simulated prevalence of the disease of up to 25%, with a fixed pool size of 10. (c) Expected number of tests per specimen for the proof-of-concept Smart Pooling Implementation. Each point on the graph represents the performance of both methods for a day in the proof-of-concept stage. The x-axis depicts the incidence of the corresponding day. Efficiency improves by reducing the expected number of tests per specimen. Smart Pooling has a similar efficiency compared to Dorfman’s testing since the daily incidence is lower than 10%. However, for every day of the implementation, Smart Pooling has a higher efficiency than individual testing.
Figure 3
Figure 3
Comparison between Smart Pooling and Random ordering. Left: samples arranged by Smart Pooling. Right: random ordering of samples. Note that Smart Pooling groups positive samples generating long groups of negative samples, reducing the number of times it is necessary to perform the second stage of Dorfman’s testing, increasing the efficiency.
Figure 4
Figure 4
Efficiency of two-stage Dorfman’s pooling as a function of prevalence p and pool size c.
Figure 5
Figure 5
Optimal pooling strategies given a maximum pool size of (a) 10 and (b) 5.

Similar articles

Cited by

References

    1. Zhu N, et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 2020;382:727–733. doi: 10.1056/NEJMoa2001017. - DOI - PMC - PubMed
    1. World Health Organization. Coronavirus disease 2019 (covid-19): Situation report, 72. World Health Organization (2020).
    1. Max Roser, E. O.-O., Ritchie, H. & Hasell, J. Coronavirus pandemic (covid-19). Our World in Data. https://ourworldindata.org/coronavirus (2020).
    1. Kucharski AJ, et al. Effectiveness of isolation, testing, contact tracing and physical distancing on reducing transmission of sars-cov-2 in different settings. Lancet. 2020 doi: 10.1016/S1473-3099(20)30457-6. - DOI - PMC - PubMed
    1. Bai Y, et al. Presumed asymptomatic carrier transmission of COVID-19. JAMA. 2020;323:1406–1407. doi: 10.1001/jama.2020.2565. - DOI - PMC - PubMed

Publication types