Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 12;19(14):3079.
doi: 10.3390/s19143079.

Deep PPG: Large-Scale Heart Rate Estimation with Convolutional Neural Networks

Affiliations

Deep PPG: Large-Scale Heart Rate Estimation with Convolutional Neural Networks

Attila Reiss et al. Sensors (Basel). .

Abstract

Photoplethysmography (PPG)-based continuous heart rate monitoring is essential in a number of domains, e.g., for healthcare or fitness applications. Recently, methods based on time-frequency spectra emerged to address the challenges of motion artefact compensation. However, existing approaches are highly parametrised and optimised for specific scenarios of small, public datasets. We address this fragmentation by contributing research into the robustness and generalisation capabilities of PPG-based heart rate estimation approaches. First, we introduce a novel large-scale dataset (called PPG-DaLiA), including a wide range of activities performed under close to real-life conditions. Second, we extend a state-of-the-art algorithm, significantly improving its performance on several datasets. Third, we introduce deep learning to this domain, and investigate various convolutional neural network architectures. Our end-to-end learning approach takes the time-frequency spectra of synchronised PPG- and accelerometer-signals as input, and provides the estimated heart rate as output. Finally, we compare the novel deep learning approach to classical methods, performing evaluation on four public datasets. We show that on large datasets the deep learning model significantly outperforms other methods: The mean absolute error could be reduced by 31 % on the new dataset PPG-DaLiA, and by 21 % on the dataset WESAD.

Keywords: CNN; PPG; dataset; deep learning; evaluation methods; heart rate; time-frequency spectrum.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Time-frequency spectra of PPG-signals, derived from the IEEE datasets. Left plot: IEEE_Training dataset, session #8. Right plot: IEEE_Test dataset, session #6 (subject 5, exercise type 2).
Figure 2
Figure 2
Data collection protocol: example recording of subject S7. X-axis: activities and transient periods performed during the protocol. Y-axis: heart rate, extracted from the ECG signal.
Figure 3
Figure 3
Example ECG-signal snippet from the data recording of subject S1. The two encircled R-peaks were falsely identified by the R-peak detector, and were thus manually removed during heart rate ground truth generation.
Figure 4
Figure 4
IEEE_Training dataset, session S10: comparison of ground truth (HRref) and estimated (HRest) heart rate. Left plot: heart rate estimation with the SpaMa-approach, right plot: heart rate estimation with the SpaMaPlus-approach.
Figure 5
Figure 5
Proposed CNN-architecture with NL=18 convolution-maxpool layers. Ntr refers to the number of segments used together for heart rate tracking (see Section 4.1 for details). N depends on NL. Input: Ntr×Nch×NFFT matrix. Output: heart rate [bpm].
Figure 6
Figure 6
Evaluation results on IEEE_Training (left) and IEEE_Test (right) datasets, investigating the number of convolutional layers and ensemble prediction. LOSO results of the classical methods displayed for reference. NL refers to the hyperparameter defining the number of convolution-pooling layers, see Section 4.1.
Figure 7
Figure 7
Heart rate estimation on the entire session S7 of the dataset PPG-DaLiA: Ground truth vs. prediction based on the SpaMaPlus approach vs. prediction based on our deep learning model (CNN ensemble). Activity labels are displayed in the lower subplot of the figure.
Figure 8
Figure 8
Heart rate estimation on the cycling part of session S7 of the dataset PPG-DaLiA: Ground truth vs. prediction based on the SpaMaPlus approach vs. prediction based on our deep learning model (CNN ensemble). Activity labels are displayed in the lower subplot of the figure.
Figure 9
Figure 9
Proposed resource-constrained CNN-model. Input: 257×4 matrix (time-frequency spectra of the PPG-signal and three acceleration channels). Output: heart rate [bpm].

References

    1. Apple Apple Watch Series Official Website. [(accessed on 21 May 2019)];2019 Available online: https://www.apple.com/lae/watch/
    1. Fitbit Fitbit Charge 3. [(accessed on 21 May 2019)];2018 Available online: https://www.fitbit.com/charge3.
    1. Samsung Samsung Simband Official Website. [(accessed on 21 May 2019)];2018 Available online: https://www.simband.io/
    1. Cup I.S.P. Heart Rate Monitoring During Physical Exercise using Wrist-Type Photoplethysmographic (PPG) Signals. [(accessed on 21 May 2019)];2015 Available online: https://sites.google.com/site/researchbyzhang/ieeespcup2015. - PubMed
    1. Zhang Z. Photoplethysmography-based Heart Rate Monitoring in Physical Activities via Joint Sparse Spectrum Reconstruction. IEEE Trans. Biomed. Eng. 2015;62:1902–1910. doi: 10.1109/TBME.2015.2406332. - DOI - PubMed

LinkOut - more resources