Enhancing SARS-CoV-2 Lineage Surveillance through the Integration of a Simple and Direct qPCR-Based Protocol Adaptation with Established Machine Learning Algorithms
- PMID: 39495866
- PMCID: PMC11579975
- DOI: 10.1021/acs.analchem.4c04492
Enhancing SARS-CoV-2 Lineage Surveillance through the Integration of a Simple and Direct qPCR-Based Protocol Adaptation with Established Machine Learning Algorithms
Abstract
Emerging and evolving Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) lineages, adapted to changing epidemiological conditions, present unprecedented challenges to global public health systems. Here, we introduce an adapted analytical approach that complements genomic sequencing, applying a cost-effective quantitative polymerase chain reaction (qPCR)-based assay. Viral RNA samples from SARS-CoV-2 positive cases detected by diagnostic laboratories or public health network units in Ceará, Brazil, were tracked for genomic surveillance and analyzed by using paired-end sequencing combined with integrative genomic analysis. Validation of a key structural variation was conducted with gel electrophoresis for the presence of a specific open reading frame 7a(ORF7a) gene deletion within the "BE.9" lineages tracked. The analytical innovation of our method is the optimization of a simple intercalating dye-based qPCR assay through repositioning primers from the ARTIC v4.1 amplicon panel to detect large molecular patterns. This assay distinguishes between "BE.9" and "non-BE.9" lineages, particularly BQ.1, without the need for expensive probes or sequencing. The protocol was validated against lineage predictions from next-generation sequencing (NGS) using 525 paired samples, achieving 93.3% sensitivity, 95.1% specificity, and 92.4% agreement, as measured by Cohen's Kappa coefficient. Machine learning (ML) models were trained using the melting curves from intercalating dye-based qPCR of 1724 samples, enabling highly accurate lineage assignment. Among them, the support vector machine (SVM) model had the best performance and after fine-tuning showed ∼96.52% (333/345) accuracy in comparison to the test data set. Our integrated approach provides an adapted analytical method that is both cost-effective and scalable, suitable for rapid assessment of emerging variants, especially in resource-limited settings. In this work, the protocol is applied to improve the monitoring of SARS-CoV-2 sublineages but can be extended to track any key molecular signature, including large insertions and deletions (indels) commonly observed in pathogenic agent subtypes. By offering a complement to traditional sequencing methods and utilizing easily trainable machine learning algorithms, our methodology contributes to enhanced molecular surveillance strategies and supports global efforts in pandemic control.
Conflict of interest statement
The authors declare no competing financial interest.
Figures




References
-
- Abulsoud A. I.; El-Husseiny H. M.; El-Husseiny A. A.; El-Mahdy H. A.; Ismail A.; Elkhawaga S. Y.; Khidr E. G.; Fathi D.; Mady E. A.; Najda A.; Algahtani M.; Theyab A.; Alsharif K. F.; Albrakati A.; Bayram R.; Abdel-Daim M. M.; Doghish A. S. Mutations in SARS-CoV-2: Insights on Structure, Variants, Vaccines, and Biomedical Interventions. Biomed. Pharmacother. 2023, 157, 11397710.1016/j.biopha.2022.113977. - DOI - PMC - PubMed
-
- Tomaszewski T.; DeVries R. S.; Dong M.; Bhatia G.; Norsworthy M. D.; Zheng X.; Caetano-Anollés G. New Pathways of Mutational Change in SARS-CoV-2 Proteomes Involve Regions of Intrinsic Disorder Important for Virus Replication and Release. Evol. Bioinf. 2020, 16, 117693432096514910.1177/1176934320965149. - DOI - PMC - PubMed
-
- Carabelli A. M.; Peacock T. P.; Thorne L. G.; Harvey W. T.; Hughes J.; de Silva T. I.; Peacock S. J.; Barclay W. S.; de Silva T. I.; Towers G. J.; Robertson D. L. SARS-CoV-2 Variant Biology: Immune Escape, Transmission and Fitness. Nat. Rev. Microbiol. 2023, 21, 162–177. 10.1038/s41579-022-00841-7. - DOI - PMC - PubMed
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous