Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Feb 27;24(5):1526.
doi: 10.3390/s24051526.

Missing Data Statistics Provide Causal Insights into Data Loss in Diabetes Health Monitoring by Wearable Sensors

Affiliations

Missing Data Statistics Provide Causal Insights into Data Loss in Diabetes Health Monitoring by Wearable Sensors

Carlijn I R Braem et al. Sensors (Basel). .

Abstract

Background: Data loss in wearable sensors is an inevitable problem that leads to misrepresentation during diabetes health monitoring. We systematically investigated missing wearable sensors data to get causal insight into the mechanisms leading to missing data.

Methods: Two-week-long data from a continuous glucose monitor and a Fitbit activity tracker recording heart rate (HR) and step count in free-living patients with type 2 diabetes mellitus were used. The gap size distribution was fitted with a Planck distribution to test for missing not at random (MNAR) and a difference between distributions was tested with a Chi-squared test. Significant missing data dispersion over time was tested with the Kruskal-Wallis test and Dunn post hoc analysis.

Results: Data from 77 subjects resulted in 73 cleaned glucose, 70 HR and 68 step count recordings. The glucose gap sizes followed a Planck distribution. HR and step count gap frequency differed significantly (p < 0.001), and the missing data were therefore MNAR. In glucose, more missing data were found in the night (23:00-01:00), and in step count, more at measurement days 6 and 7 (p < 0.001). In both cases, missing data were caused by insufficient frequency of data synchronization.

Conclusions: Our novel approach of investigating missing data statistics revealed the mechanisms for missing data in Fitbit and CGM data.

Keywords: activity trackers; biomedical sensors; continuous glucose monitoring; health monitoring; heart rate; missing data; signal processing; vital signs; wearable sensors.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Overview of the data processing steps.
Figure 2
Figure 2
Overview of the tests to determine the missing data mechanism in continuous data using missing data statistics. MAR = missing at random, MCAR = missing completely at random, MNAR = missing not at random.
Figure 3
Figure 3
Gap size probability distribution (light grey) with the fitted Planck probability mass function (pmf) in dark grey for the three types of data. (a) Glucose gap frequency of glucose data with Planck pmf. (b) Heart rate gap frequency with fitted Planck and Zipf pmf in orange. (c) Step count gap frequency with Planck pmf, which starts at 16 samples due to the step count pre-processing.
Figure 4
Figure 4
Three missing data dispersions over time in the left panels with (in red) the ratio of missing data. In the right panels, the subsequent Dunn post hoc analysis. (a) Missing data dispersion in glucose data in hours of the day and (b) post hoc analysis to test for differences in glucose data between hours of the day. (c) Missing data dispersion in heart rate data over measurement days and (d) post hoc analysis for test differences between heart rate measurement days. (e) Missing data dispersion in step count data over measurement day and (f) post hoc analysis for test differences between step count measurement days.
Figure 5
Figure 5
The missing data dispersion (left panels) and corresponding post hoc analysis (right panels) of missing data for every hour of the day in missing data subgroups. In (a,c,e), the red bars indicate the median ratio of missing data, and the error bars indicate the IQR. (a) Missing dispersion for hour of the day in group <10% missing data in glucose data. (b) Post hoc analysis for hour of the day in group <10% missing data in glucose data. (c) Missing dispersion for hour of the day in group 10–20% missing data in glucose data. (d) Post hoc analysis for hour of the day in group 10–20% missing data in glucose data. (e) Missing dispersion for hour of the day in group >20% missing data in glucose data. (f) Post hoc analysis for hour of the day in group >20% missing data in glucose data.
Figure 6
Figure 6
Missing data distribution (left panels) in glucose data in the subgroups based on years of type 2 diabetes diagnosis for every hour of the day and corresponding post hoc analysis (right panels). In (a,c,e), the red bars indicate the median ratio of missing data, and the error bars are the IQR. (a) Glucose missing data dispersion for every hour of the day in subgroup <10 years since diagnosis of type 2 diabetes (n = 33). (b) Post hoc analysis for hour of the day dispersion in <10 years since diagnosis of type 2 diabetes. (c) Glucose missing dispersion for every hour of the day in subgroup 10–20 years since diagnosis of type 2 diabetes (n = 25). (d) Post hoc analysis for hour of the day dispersion in 10–20 years since diagnosis of type 2 diabetes. (e) Glucose missing dispersion for every hour of the day in subgroup >20 years since diagnosis of type 2 diabetes (n = 16). (f) Post hoc analysis for hour of the day dispersion in >20 years since diagnosis of type 2 diabetes.

Similar articles

Cited by

References

    1. Michie S., Abraham C., Whittington C., McAteer J., Gupta S. Effective Techniques in Healthy Eating and Physical Activity Interventions: A Meta-Regression. Health Psychol. 2009;28:690–701. doi: 10.1037/a0016136. - DOI - PubMed
    1. Phillips S.M., Cadmus-Bertram L., Rosenberg D., Buman M.P., Lynch B.M. Wearable Technology and Physical Activity in Chronic Disease: Opportunities and Challenges. Am. J. Prev. Med. 2018;54:144–150. doi: 10.1016/j.amepre.2017.08.015. - DOI - PMC - PubMed
    1. Peters A., Cohen N., Calhoun P., Ruedy K.J., Beck R.W., Martens T.W., Bao S., Njeru N.M., Beck S.E., Price D.A. Glycaemic Profiles of Diverse Patients with Type 2 Diabetes Using Basal Insulin: MOBILE Study Baseline Data. Diabetes Obes. Metab. 2021;23:631–636. doi: 10.1111/dom.14238. - DOI - PMC - PubMed
    1. Maher C., Ryan J., Ambrosi C., Edney S. Users’ Experiences of Wearable Activity Trackers: A Cross-Sectional Study. BMC Public Health. 2017;17:880. doi: 10.1186/s12889-017-4888-1. - DOI - PMC - PubMed
    1. Matthews C.E., Hagströmer M., Pober D.M., Bowles H.R. Best Practices for Using Physical Activity Monitors in Population-Based Research. MSSE. 2012;44:S68–S76. doi: 10.1249/MSS.0b013e3182399e5b. - DOI - PMC - PubMed