Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Mar 12;21(3):274.
doi: 10.3390/e21030274.

An Improved Method of Handling Missing Values in the Analysis of Sample Entropy for Continuous Monitoring of Physiological Signals

Affiliations

An Improved Method of Handling Missing Values in the Analysis of Sample Entropy for Continuous Monitoring of Physiological Signals

Xinzheng Dong et al. Entropy (Basel). .

Abstract

Medical devices generate huge amounts of continuous time series data. However, missing values commonly found in these data can prevent us from directly using analytic methods such as sample entropy to reveal the information contained in these data. To minimize the influence of missing points on the calculation of sample entropy, we propose a new method to handle missing values in continuous time series data. We use both experimental and simulated datasets to compare the performance (in percentage error) of our proposed method with three currently used methods: skipping the missing values, linear interpolation, and bootstrapping. Unlike the methods that involve modifying the input data, our method modifies the calculation process. This keeps the data unchanged which is less intrusive to the structure of the data. The results demonstrate that our method has a consistent lower average percentage error than other three commonly used methods in multiple common physiological signals. For missing values in common physiological signal type, different data size and generating mechanism, our method can more accurately extract the information contained in continuously monitored data than traditional methods. So it may serve as an effective tool for handling missing values and may have broad utility in analyzing sample entropy for common physiological signals. This could help develop new tools for disease diagnosis and evaluation of treatment effects.

Keywords: complexity; medical information; missing values; physiological data; sample entropy.

PubMed Disclaimer

Conflict of interest statement

The authors declare no potential conflict of interest with respect to the research, authorship, and/or publication of this article.

Figures

Figure A1
Figure A1
Percentage errors for each method in three types of continuously monitoring physiological signals: blood glucose level (top panel), RR interval (middle panel), air flow (bottom panel). Each panel includes three conditions: r = 0.1 × σ (left), r = 0.15 × σ (middle) and r = 0.2 × σ (right). Values are given as means ± standard deviation. The results for BootSampEn are not shown in the Figure 2 when the percentage error is too large and out of range of the figure.
Figure 1
Figure 1
Two schemes for generating missing values.
Figure 2
Figure 2
Performance of methods for handling missing values in four types of continuously monitoring physiological signals: air flow (left), blood glucose level (middle left-skewed), EEG (middle right-skewed), RR interval (right). Values are given as means ± standard deviation. The results for BootSampEn are not shown in the Figure 2 when the percentage error is too large and out of range of the figure.
Figure 3
Figure 3
Average percentage errors for each method in four types of continuously monitoring physiological signals: air flow (left), blood glucose level (middle left-skewed), RR interval (right). The percentage errors by BootSampEn in the left and middle panels are higher than 120% and are not shown in the figure. Values are given as means ± standard error. NS means p > 0.05, * means p < 0.05, ** means p < 0.01, *** means p < 0.001.
Figure 4
Figure 4
Performance of methods for handling missing values in air flow time series of nine large data sizes. Values are given as mean ± standard deviation. The percentage error for BootSampEn is out of the range and not shown in the figure.
Figure 5
Figure 5
Performance of methods for handling missing values in a dataset with a small size (i.e., less than 2000 data points) for four types of physiological signals (i.e., air flow, blood glucose level, EEG and RR interval. Values are given as mean ± standard deviation. The percentage error for BootSampEn is out of the range and not shown in the figure.
Figure 6
Figure 6
Percentage errors for each method using random sample marking and group-based random marking schemes to generate missing values, respectively. Values are given as mean±standard deviation. The percentage error for BootSampEn is out of the range and not shown in the figure.
Figure 7
Figure 7
Evaluation of the running time of four methods for handling missing values. Each method is run on five values of percentage of missing values (i.e., 10%, 20%, 30%, 40% and 50%) on the air flow dataset. The running time for them are summed up to be the total running time. This process is repeated 10 times. The average total running time for the 10 repeats is shown in the y-axis. Values are given as mean ± standard deviation. Note, the standard deviation is tiny so that it can hardly been seen in the figure.

Similar articles

Cited by

References

    1. Shannon C.E., Weaver W. The Mathematical Theory of Information. Math. Gazette. 1949;97:170–180.
    1. Ivanov P.C., Amaral L.A., Goldberger A.L., Havlin S., Rosenblum M.G., Struzik Z.R., Stanley H.E. Multifractality in Human Heartbeat Dynamics. Nature. 1999;399:461. doi: 10.1038/20924. - DOI - PubMed
    1. Chen C., Jin Y., Lo I.L., Zhao H., Sun B., Zhao Q., Zheng J., Zhang X.D. Complexity Change in Cardiovascular Disease. Int. J. Biol. Sci. 2017;13:1320–1328. doi: 10.7150/ijbs.19462. - DOI - PMC - PubMed
    1. Henriques T.S., Costa M.D., Mathur P., Mathur P., Davis R.B., Mittleman M.A., Khabbaz K.R., Goldberger A.L., Subramaniam B. Complexity of Preoperative Blood Pressure Dynamics: Possible Utility in Cardiac Surgical Risk Assessment. J. Clin. Monit. Comput. 2018;33:31–38. doi: 10.1007/s10877-018-0133-4. - DOI - PMC - PubMed
    1. Kohnert K.D., Heinke P., Vogt L., Augstein P., Thomas A., Salzsieder E. Associations of Blood Glucose Dynamics with Antihyperglycemic Treatment and Glycemic Variability in Type 1 and Type 2 Diabetes. J. Endocrinol. Investig. 2017;40:1201–1207. doi: 10.1007/s40618-017-0682-2. - DOI - PubMed

LinkOut - more resources