A machine learning-based approach to determine infection status in recipients of BBV152 (Covaxin) whole-virion inactivated SARS-CoV-2 vaccine for serological surveys
- PMID: 35483225
- PMCID: PMC9040372
- DOI: 10.1016/j.compbiomed.2022.105419
A machine learning-based approach to determine infection status in recipients of BBV152 (Covaxin) whole-virion inactivated SARS-CoV-2 vaccine for serological surveys
Abstract
Data science has been an invaluable part of the COVID-19 pandemic response with multiple applications, ranging from tracking viral evolution to understanding the vaccine effectiveness. Asymptomatic breakthrough infections have been a major problem in assessing vaccine effectiveness in populations globally. Serological discrimination of vaccine response from infection has so far been limited to Spike protein vaccines since whole virion vaccines generate antibodies against all the viral proteins. Here, we show how a statistical and machine learning (ML) based approach can be used to discriminate between SARS-CoV-2 infection and immune response to an inactivated whole virion vaccine (BBV152, Covaxin). For this, we assessed serial data on antibodies against Spike and Nucleocapsid antigens, along with age, sex, number of doses taken, and days since last dose, for 1823 Covaxin recipients. An ensemble ML model, incorporating a consensus clustering approach alongside the support vector machine model, was built on 1063 samples where reliable qualifying data existed, and then applied to the entire dataset. Of 1448 self-reported negative subjects, our ensemble ML model classified 724 to be infected. For method validation, we determined the relative ability of a random subset of samples to neutralize Delta versus wild-type strain using a surrogate neutralization assay. We worked on the premise that antibodies generated by a whole virion vaccine would neutralize wild type more efficiently than delta strain. In 100 of 156 samples, where ML prediction differed from self-reported uninfected status, neutralization against Delta strain was more effective, indicating infection. We found 71.8% subjects predicted to be infected during the surge, which is concordant with the percentage of sequences classified as Delta (75.6%-80.2%) over the same period. Our approach will help in real-world vaccine effectiveness assessments where whole virion vaccines are commonly used.
Keywords: BBV152; COVID-19; Covaxin; Ensemble methods; Infection; Machine learning; SARS-CoV-2.
Copyright © 2022 The Author(s). Published by Elsevier Ltd.. All rights reserved.
Conflict of interest statement
We declare no conflict of interest.
Figures



References
-
- Gupta R.K., Marks M., Samuels T.H.A., Luintel A., Rampling T., Chowdhury H., Quartagno M., Nair A., Lipman M., Abubakar I., van Smeden M., Wong W.K., Williams B., Noursadeghi M. UCLH COVID-19 Reporting Group, Systematic evaluation and external validation of 22 prognostic models among hospitalised adults with COVID-19: an observational cohort study. Eur. Respir. J. 2020;56 doi: 10.1183/13993003.03498-2020. - DOI - PMC - PubMed
-
- Singanayagam A., Hakki S., Dunning J., Madon K.J., Crone M.A., Koycheva A., Derqui-Fernandez N., Barnett J.L., Whitfield M.G., Varro R., Charlett A., Kundu R., Fenn J., Cutajar J., Quinn V., Conibear E., Barclay W., Freemont P.S., Taylor G.P., Ahmad S., Zambon M., Ferguson N.M., Lalvani A., Badhan A., Dustan S., Tejpal C., Ketkar A.V., Narean J.S., Hammett S., McDermott E., Pillay T., Houston H., Luca C., Samuel J., Bremang S., Evetts S., Poh J., Anderson C., Jackson D., Miah S., Ellis J., Lackenby A. Community transmission and viral load kinetics of the SARS-CoV-2 delta (B.1.617.2) variant in vaccinated and unvaccinated individuals in the UK: a prospective, longitudinal, cohort study. Lancet Infect. Dis. 2022;22:183–195. doi: 10.1016/s1473-3099(21)00648-4. - DOI - PMC - PubMed
-
- Pelleau S., Woudenberg T., Rosado J., Donnadieu F., Garcia L., Obadia T., Gardais S., Elgharbawy Y., Velay A., Gonzalez M., Nizou J.Y., Khelil N., Zannis K., Cockram C., Hélène Merkling S., Meola A., Kerneis S., Terrier B., de Seze J., Planas D., Schwartz O., Dejardin F., Petres S., von Platen C., Arowas L., de Facci L.P., Duffy D., Cheallaigh C.N., Conlon N., Townsend L., Auerswald H., Backovic M., Hoen B., Fontanet A., Mueller I., Fafi-Kremer S., Bruel T., White M. Serological reconstruction of COVID-19 epidemics through analysis of antibody kinetics to SARS-CoV-2 proteins. bioRxiv. 2021 doi: 10.1101/2021.03.04.21252532. - DOI
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Medical
Miscellaneous