Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 10;17(1):697-710.
doi: 10.1021/acsnano.2c10159. Epub 2022 Dec 21.

Virus Detection and Identification in Minutes Using Single-Particle Imaging and Deep Learning

Affiliations

Virus Detection and Identification in Minutes Using Single-Particle Imaging and Deep Learning

Nicolas Shiaelis et al. ACS Nano. .

Abstract

The increasing frequency and magnitude of viral outbreaks in recent decades, epitomized by the COVID-19 pandemic, has resulted in an urgent need for rapid and sensitive diagnostic methods. Here, we present a methodology for virus detection and identification that uses a convolutional neural network to distinguish between microscopy images of fluorescently labeled intact particles of different viruses. Our assay achieves labeling, imaging, and virus identification in less than 5 min and does not require any lysis, purification, or amplification steps. The trained neural network was able to differentiate SARS-CoV-2 from negative clinical samples, as well as from other common respiratory pathogens such as influenza and seasonal human coronaviruses. We were also able to differentiate closely related strains of influenza, as well as SARS-CoV-2 variants. Additional and novel pathogens can easily be incorporated into the test through software updates, offering the potential to rapidly utilize the technology in future infectious disease outbreaks or pandemics. Single-particle imaging combined with deep learning therefore offers a promising alternative to traditional viral diagnostic and genomic sequencing methods and has the potential for significant impact.

Keywords: SARS-CoV-2; fluorescence microscopy; influenza; machine learning; viral diagnostics.

PubMed Disclaimer

Conflict of interest statement

The authors declare the following competing financial interest(s): The work was carried out using a wide-field microscope from Oxford Nanoimaging, a company in which A.N.K. is a co-founder and shareholder, and is being commercialized by Pictura Bio, a company in which N.C.R. and N.S. are co-founders. Patent applications relating to the work have been submitted by N.C.R., A.N.K., and N.S. (PCT/GB2019/053073 and PCT/GB2021/050990).

Figures

Figure 1
Figure 1
A fluorescent labeling and imaging strategy to detect viruses. (A) Overview: (i) Viruses were labeled and imaged. Individual signals were isolated, and a convolutional neural network (CNN) was trained to exploit differences in the features of different viruses to identify them. (ii) Signals from unknown samples can then be fed into the trained CNN to allow (iii) virus classification. (B) Representative fields of view (FOVs) of infectious bronchitis virus (CoV (IBV)). 1 ×104 PFU/mL virus was labeled with 0.23 M SrCl2, 1 nM Cy3 (green) DNA, and 1 nM Atto647N (red) DNA before being imaged. Green DNA was observed in the green channel (top panels) and red DNA in the red channel (middle panels); merged red and green localizations are shown in the lower panels. Scale bar, 10 μm. A negative control where DNA was replaced with water is included. (C, D) Zoomed-in images from (B); white boxes represent examples of colocalized particles. Scale bar, 5 μm. (E) Segmentation process: (i) Cropped FOV from the red channel. (ii) Intensity filtering applied to (i) to produce a binary image. (iii) Area filtering applied to (ii) to include only 10–100 pixel objects. (iv) Location image associated with (i). (v) Colocalized signals in the location image. (vi) Bounding boxes (BBXs) found from (iii) drawn onto (v). Non-colocalized objects (cyan) are rejected. (vii) Colocalized objects (red) are drawn over (i). Scale bar, 10 μm. (F) Plot of mean number of BBXs per FOV for labeled CoV (IBV) and the negative controls. Error bars represent the standard deviation of 81 FOVs from a single slide. Statistical significance was determined by one-way ANOVA, *P = 6.01 × 10–22.
Figure 2
Figure 2
Design of a convolutional neural network to classify imaged viruses. (A) Representative FOVs of fluorescently labeled coronavirus (CoV (IBV)), two strains of H3N2 influenza (A/Udorn/72 (Udorn) and A/Aichi/68 (X31)), an H1N1 influenza strain (A/PR8/8/34 (PR8), and a negative control where virus was substituted with allantoic fluid. The samples were immobilized and labeled with 0.23 M SrCl2, 1 nM Cy3 (green) DNA, and 1 nM Atto647N (red) DNA before being imaged. Merged red and green localizations are shown; examples of colocalizations are highlighted with white boxes. Scale bar, 10 μm. (B–D) Normalized frequency plots of the maximum pixel intensity, area, and semimajor-to-semiminor axis ratio within the BBXs of the four different viruses. Values taken from 81 FOVs from a single slide for each virus. Statistical significance was determined by one-way ANOVA, P values depicted above graphs. (E) Illustration of the 15-layer shallow convolutional neural network. Following the input layer (inputs comprising BBXs from the segmentation process), the network consists of three convolutions (stages 1–3). Stages 1 and 2 each contain a ReLU layer to introduce non-linearity, a batch normalization layer (not shown), and a max pooling layer, while stage 3 lacks a max-pooling layer. The classification stage has a fully connected layer and a softmax layer to convert the output of the previous layer to a normalized probability distribution, allowing the initial input to be classified.
Figure 3
Figure 3
Network validation results for laboratory-grown virus strains. (A) Network validation results shown as a confusion matrix: rows, predicted class (output class); columns, true class (target class); right column, positive and negative predictive values (percentages of BBXs that are correctly and incorrectly predicted); bottom row, sensitivity and specificity. (B) Confusion matrix of CoV (IBV) positive and negative samples. (C, D) Confusion matrices of CoV (IBV) vs influenza Udorn or PR8. (E) Confusion matrix of influenza PR8 vs influenza WSN. (F) Confusion matrix of CoV (IBV) vs a pooled dataset of the virus-negative control and three influenza A strains. (G) A trained network is robust over significant time. The network was trained on data from images of the virus IBV and allantoic fluid as a negative control. Each data point (orange for sensitivity; green for specificity) corresponds to the classification result for signals detected at different dates over a period of 135 days. Error bars represent standard deviation. (H) Defining the limit of detection for accurate machine learning classification. Increasing concentrations of IBV were labeled and imaged, the resulting images were fed into the trained network. The number of normalized positive particles (positive particles/all particles) increased linearly with increasing virus concentration. Error bars represent standard deviation. The limit of detection (LOD) was defined as 6 × 104 PFU/mL, with 99.85% certainty.
Figure 4
Figure 4
A deep learning network can differentiate viruses in clinical samples. (A) Workflow for training and validation of clinical samples. Samples were collected from 33 patients, labeled, and imaged on a microscope over three different days. The images were processed to isolate the individual signals into BBXs. 70% of the BBXs were used to train a convolutional neural network (CNN), resulting in a trained model. The remaining 30% of the BBXs were used to validate the trained model, providing the result in a confusion matrix. (B) Confusion matrix showing that a trained network could differentiate positive (Alpha variant) SARS-CoV-2 and negative clinical samples. (C) Confusion matrix showing that a trained network could differentiate influenza A (Flu A) positive clinical samples from negative samples. (D) Confusion matrix showing that a trained network could differentiate SARS-CoV-2 samples (original Wuhan variant) from seasonal human coronavirus (hCoV) samples.
Figure 5
Figure 5
Independent testing of the trained network with clinical samples. (A) Schematic of workflow of independent testing. Previously unseen samples are imaged, the images are processed into BBXs which are fed through a trained network. When necessary, a chi-squared statistical test is performed to test the null hypothesis that the sample is negative. If the p-value is smaller than a pre-set confidence threshold, the null hypothesis is rejected and the sample is classified as positive. If the p-value is greater than the threshold, the sample is classified as negative. (B) Summary of independent testing results using multiple trained models. 51 patient samples that were not used for network training or validation were run through different trained versions of the network, detailed on the bottom of the plot. Some samples were tested in multiple versions of the network, for further details see Sup.Table 2. Chi-squared tests were carried out to classify the samples (see Sup.Figure 8 and Table 2; samples with a p-value smaller than a preset confidence threshold were classified as positive) and the results compared to RT-PCR. 50 out of 51 samples were classified correctly (incorrect classification shown in black), giving an overall sample accuracy of 98.03%. Results were obtained using two different microscopes (see Experimental Methods). (C) Summary of independent testing results using a single trained network. 104 patient samples were analyzed as in B), but tested in a single trained network (SARS-CoV-2 vs negative). (D) Variant classification of clinical samples identified as positive in (C). The BBXs from images of the clinical samples classified as positive by the first network were passed through a second trained model (Wuhan + Alpha SARS-CoV-2 vs Delta SARS-CoV-2).

References

    1. Udugama B.; Kadhiresan P.; Kozlowski H. N.; Malekjahani A.; Osborne M.; Li V. Y. C.; Chen H.; Mubareka S.; Gubbay J. B.; Chan W. C. W. Diagnosing COVID-19: The Disease and Tools for Detection. ACS Nano 2020, 14 (4), 3822–3835. 10.1021/acsnano.0c02624. - DOI - PubMed
    1. Huang W. E.; Lim B.; Hsu C. C.; Xiong D.; Wu W.; Yu Y.; Jia H.; Wang Y.; Zeng Y.; Ji M.; Chang H.; Zhang X.; Wang H.; Cui Z. RT-LAMP for rapid diagnosis of coronavirus SARS-CoV-2. Microb Biotechnol 2020, 13 (4), 950–961. 10.1111/1751-7915.13586. - DOI - PMC - PubMed
    1. Lu R.; Wu X.; Wan Z.; Li Y.; Zuo L.; Qin J.; Jin X.; Zhang C. Development of a Novel Reverse Transcription Loop-Mediated Isothermal Amplification Method for Rapid Detection of SARS-CoV-2. Virol Sin 2020, 35 (4), 344–347. 10.1007/s12250-020-00218-1. - DOI - PMC - PubMed
    1. Lamb L. E.; Bartolone S. N.; Ward E.; Chancellor M. B. Rapid detection of novel coronavirus/Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) by reverse transcription-loop-mediated isothermal amplification. PLoS One 2020, 15 (6), e0234682.10.1371/journal.pone.0234682. - DOI - PMC - PubMed
    1. Zhang Y.; Odiwuor N.; Xiong J.; Sun L.; Nyaruaba R. O.; Wei H.; Tanner N. A. Rapid Molecular Detection of SARS-CoV-2 (COVID-19) Virus RNA Using Colorimetric LAMP. medRxiv 2020, 2020.02.26.20028373(accessed July 07, 2020)10.1101/2020.02.26.20028373. - DOI

Publication types

Supplementary concepts