Passive detection of COVID-19 with wearable sensors and explainable machine learning algorithms

Matteo Gadaleta¹, Jennifer M Radin¹, Katie Baca-Motes¹, Edward Ramos^{1

2}, Vik Kheterpal², Eric J Topol¹, Steven R Steinhubl¹, Giorgio Quer³

Affiliations

¹ Scripps Research Translational Institute, 3344N Torrey Pines Ct Plaza Level, La Jolla, CA, 92037, USA.
² CareEvolution, 625N Main Street, Ann Arbor, MI, 48104, USA.
³ Scripps Research Translational Institute, 3344N Torrey Pines Ct Plaza Level, La Jolla, CA, 92037, USA. gquer@scripps.edu.

PMID: 34880366
PMCID: PMC8655005
DOI: 10.1038/s41746-021-00533-1

Passive detection of COVID-19 with wearable sensors and explainable machine learning algorithms

Matteo Gadaleta et al. NPJ Digit Med. 2021.

. 2021 Dec 8;4(1):166.

doi: 10.1038/s41746-021-00533-1.

Authors

Matteo Gadaleta¹, Jennifer M Radin¹, Katie Baca-Motes¹, Edward Ramos^{1

2}, Vik Kheterpal², Eric J Topol¹, Steven R Steinhubl¹, Giorgio Quer³

Affiliations

¹ Scripps Research Translational Institute, 3344N Torrey Pines Ct Plaza Level, La Jolla, CA, 92037, USA.
² CareEvolution, 625N Main Street, Ann Arbor, MI, 48104, USA.
³ Scripps Research Translational Institute, 3344N Torrey Pines Ct Plaza Level, La Jolla, CA, 92037, USA. gquer@scripps.edu.

PMID: 34880366
PMCID: PMC8655005
DOI: 10.1038/s41746-021-00533-1

Abstract

Individual smartwatch or fitness band sensor data in the setting of COVID-19 has shown promise to identify symptomatic and pre-symptomatic infection or the need for hospitalization, correlations between peripheral temperature and self-reported fever, and an association between changes in heart-rate-variability and infection. In our study, a total of 38,911 individuals (61% female, 15% over 65) have been enrolled between March 25, 2020 and April 3, 2021, with 1118 reported testing positive and 7032 negative for COVID-19 by nasopharyngeal PCR swab test. We propose an explainable gradient boosting prediction model based on decision trees for the detection of COVID-19 infection that can adapt to the absence of self-reported symptoms and to the available sensor data, and that can explain the importance of each feature and the post-test-behavior for the individuals. We tested it in a cohort of symptomatic individuals who exhibited an AUC of 0.83 [0.81-0.85], or AUC = 0.78 [0.75-0.80] when considering only data before the test date, outperforming state-of-the-art algorithm in these conditions. The analysis of all individuals (including asymptomatic and pre-symptomatic) when self-reported symptoms were excluded provided an AUC of 0.78 [0.76-0.79], or AUC of 0.70 [0.69-0.72] when considering only data before the test date. Extending the use of predictive algorithms for detection of COVID-19 infection based only on passively monitored data from any device, we showed that it is possible to scale up this platform and apply the algorithm in other settings where self-reported symptoms can not be collected.

PubMed Disclaimer

Conflict of interest statement

S.R.S. is employed by PhysIQ. The other authors declare no competing interests.

Figures

**Fig. 1. Output of the prediction models.**
The model’s output statistic is reported for symptomatic cases, excluding (a) and including the data after the test date (b), and for no-symptom-reported cases, excluding (c) and including the data after the test date (d). The boxes represent the IQR, and the horizontal lines are the median values. The number of cases considered for the analysis are reported in the legend.

**Fig. 2. Evaluation results for the discrimination between COVID-19 positive and COVID-19 negative.**
Receiver operating characteristic curves (ROCs) for the discrimination between COVID-19 positive and COVID-19 negative. Performance for symptomatic cases, excluding (a) and including the data after the test date (b), and for no-symptom-reported cases, excluding (c) and including the data after the test date (d), are reported. The model is a gradient boosting prediction model based on decision trees. Median values and 95% confidence intervals (CIs) for sensitivity (SE), specificity (SP), positive predictive value (PPV) and negative predictive value (NPV) are reported, considering the point on the ROC with the highest average value of sensitivity and specificity. Error bars represent 95% CIs. p-values of the one-sided Mann-Whitney U test are reported.

**Fig. 3. Overall feature importance.**
Overall feature importance based on the average prediction changes when the feature value is perturbed. Values are normalized as percentages. Features have been aggregated into macro categories. Results for symptomatic cases, excluding (a) and including data after test date (b), and for no-symptom-reported cases, excluding (c) and including data after test date (d), are reported.

**Fig. 4. Feature importance associated with specific symptoms.**
Only symptoms reported before the test date have been considered. Values are normalized as percentages. The results refer to symptomatic cases only.

**Fig. 5. Percentage of reported symptoms.**
Percentage of reported symptoms for participants who reported at least one symptom from 15 days before to 5 days after the test date. The frequencies of the indicated symptoms are shown for positive and negative cases. The error bars represent 95% percent confidence intervals. The p-values of a two-sided Fisher’s exact test applied to COVID-19 positive (539 individuals) and negative participants (1,520 individuals) are reported. Symptoms with a significant difference (p-value < 0.05) are marked with an asterisk.

**Fig. 6. Weights for the evaluation of the baseline data.**
Exponentially decreasing weights for the evaluation of the baseline data, with weekly patterns. The abscissa represents the temporal distance preceding the analyzed day considered for the baseline evaluation. The first 6 days have been excluded to avoid recent changes to affect the baseline. Additionally, if a symptom has been reported in this time frame, we set the weights to zero from the day of symptom to the next 10 days.

See this image and copyright information in PMC

References

1. NIH. COVID-19 Treatment Guidelines. https://www.covid19treatmentguidelines.nih.gov/whats-new/ (NIH, 2021).
1. Manabe YC, Sharfstein JS, Armstrong K. The need for more and better testing for COVID-19. JAMA. 2020;324:2153–2154. doi: 10.1001/jama.2020.21694. - DOI - PMC - PubMed
1. Menni C, et al. Real-time tracking of self-reported symptoms to predict potential COVID-19. Nat. Med. 2020;26:1037–1040. doi: 10.1038/s41591-020-0916-2. - DOI - PMC - PubMed
1. Oran DP, Topol EJ. Prevalence of asymptomatic SARS-CoV-2 Infection. Ann. Int. Med. 2020;173:362–367. doi: 10.7326/M20-3012. - DOI - PMC - PubMed
1. Quer G, Gouda P, Galarnyk M, Topol EJ, Steinhubl SR. Inter- and intraindividual variability in daily resting heart rate and its associations with age, sex, sleep, BMI, and time of year: Retrospective, longitudinal cohort study of 92,457 adults. PLoS ONE. 2020;15:e0227709. doi: 10.1371/journal.pone.0227709. - DOI - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Passive detection of COVID-19 with wearable sensors and explainable machine learning algorithms

Affiliations

Passive detection of COVID-19 with wearable sensors and explainable machine learning algorithms

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources

Miscellaneous