Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Sep 20;21(18):6296.
doi: 10.3390/s21186296.

Deep Learning Methods for Remote Heart Rate Measurement: A Review and Future Research Agenda

Affiliations
Review

Deep Learning Methods for Remote Heart Rate Measurement: A Review and Future Research Agenda

Chun-Hong Cheng et al. Sensors (Basel). .

Abstract

Heart rate (HR) is one of the essential vital signs used to indicate the physiological health of the human body. While traditional HR monitors usually require contact with skin, remote photoplethysmography (rPPG) enables contactless HR monitoring by capturing subtle light changes of skin through a video camera. Given the vast potential of this technology in the future of digital healthcare, remote monitoring of physiological signals has gained significant traction in the research community. In recent years, the success of deep learning (DL) methods for image and video analysis has inspired researchers to apply such techniques to various parts of the remote physiological signal extraction pipeline. In this paper, we discuss several recent advances of DL-based methods specifically for remote HR measurement, categorizing them based on model architecture and application. We further detail relevant real-world applications of remote physiological monitoring and summarize various common resources used to accelerate related research progress. Lastly, we analyze the implications of research findings and discuss research gaps to guide future explorations.

Keywords: deep learning; heart rate measurement; noncontact monitoring; rPPG; remote photoplethysmography.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Principle of remote photoplethysmography (rPPG) based on the dichromatic reflection model (DRM). The digital camera captures the specular and diffuse reflection from ambient light. The specular reflection contains surface information that does not relate to physiological signals, while the diffuse reflection is modulated by blood flow. The rPPG signal can be obtained from further signal processing.
Figure 2
Figure 2
General framework of conventional methods for remote heart rate (HR) measurement. Face detection (e.g., Viola and Jones algorithm) is performed on the video frames, resulting in the red bounding box on the face. Next, regions of interest (ROIs) such as the cheeks marked by the black boxes are selected within the face box. The rPPG signal is extracted from the pixels within the ROIs. Lastly, post-processing techniques, such as frequency analysis (e.g., Fourier transform) and peak detection, are applied on the extracted signal to estimate HR.
Figure 3
Figure 3
Schematic diagram of end-to-end deep learning (DL) methods and hybrid DL methods. End-to-end DL methods directly output the HR or rPPG signal with a single model, while hybrid DL methods utilize DL techniques at various stages.
Figure 4
Figure 4
Architecture of DeepPhys [27].
Figure 5
Figure 5
Architecture of 3D CNN PhysNet [29].
Figure 6
Figure 6
Architecture used in Yu et al. [30].
Figure 7
Figure 7
Architecture of AutoHR [31].
Figure 8
Figure 8
Architecture of ETA-rPPGNet [34].
Figure 9
Figure 9
Architecture of RNN-based PhysNet [29].
Figure 10
Figure 10
Architecture used in Deep-HR [45] for signal optimization.
Figure 11
Figure 11
Architecture used in Bian et al. [47].
Figure 12
Figure 12
Architecture used in Deep-HR [45] for signal extraction.
Figure 13
Figure 13
Architecture of Siamese-rPPG [51].
Figure 14
Figure 14
Architecture of Meta-rPPG [57].
Figure 15
Figure 15
Architecture of PRNet [58].
Figure 16
Figure 16
Architecture of PulseGAN [59].
Figure 17
Figure 17
Architecture used in Yang et al. [64].
Figure 18
Figure 18
General procedure of constructing a spatio-temporal map. Firstly, the images are aligned and ROI selection is performed to obtain ROI images. Then, these ROI images are divided into several ROI blocks. Next, within each block, the average color value is calculated for each color channel. After that, the average color value of each channel at the same block but different frames are concatenated into temporal sequences. Finally, the temporal sequences of each block are placed into rows to form a spatio-temporal map.
Figure 19
Figure 19
Architecture of RhythmNet [68].
Figure 20
Figure 20
Architecture used in Huang et al. [56].

Similar articles

Cited by

References

    1. Jeong I., Finkelstein J. Introducing Contactless Blood Pressure Assessment Using a High Speed Video Camera. J. Med. Syst. 2016;40:1–10. doi: 10.1007/s10916-016-0439-z. - DOI - PubMed
    1. Bal U. Non-contact estimation of heart rate and oxygen saturation using ambient light. Biomed. Opt. Express. 2015;6:86–97. doi: 10.1364/BOE.6.000086. - DOI - PMC - PubMed
    1. Massaroni C., Nicolò A., Sacchetti M., Schena E. Contactless Methods For Measuring Respiratory Rate: A Review. IEEE Sens. J. 2021;21:12821–12839. doi: 10.1109/JSEN.2020.3023486. - DOI
    1. Iozzia L., Cerina L., Mainardi L. Relationships between heart-rate variability and pulse-rate variability obtained from video-PPG signal using ZCA. Physiol. Meas. 2016;37:1934–1944. doi: 10.1088/0967-3334/37/11/1934. - DOI - PubMed
    1. Scalise L. Non contact heart monitoring. Adv. Electrocardiogr. Methods Anal. 2012;4:81–106.

LinkOut - more resources