DiffPhys: Enhancing Signal-to-Noise Ratio in Remote Photoplethysmography Signal Using a Diffusion Model Approach

Bioengineering (Basel). 2024 Jul 23;11(8):743. doi: 10.3390/bioengineering11080743.

Abstract

Remote photoplethysmography (rPPG) is an emerging non-contact method for monitoring cardiovascular health based on facial videos. The quality of the captured videos largely determines the efficacy of rPPG in this application. Traditional rPPG techniques, while effective for heart rate (HR) estimation, often produce signals with an inadequate signal-to-noise ratio (SNR) for reliable vital sign measurement due to artifacts like head motion and measurement noise. Another pivotal factor is the overlooking of the inherent properties of signals generated by rPPG (rPPG-signals). To address these limitations, we introduce DiffPhys, a novel deep generative model particularly designed to enhance the SNR of rPPG-signals. DiffPhys leverages the conditional diffusion model to learn the distribution of rPPG-signals and uses a refined reverse process to generate rPPG-signals with a higher SNR. Experimental results demonstrate that DiffPhys elevates the SNR of rPPG-signals across within-database and cross-database scenarios, facilitating the extraction of cardiovascular metrics such as HR and HRV with greater precision. This enhancement allows for more accurate monitoring of health conditions in non-clinical settings.

Keywords: deep learning; diffusion model; remote photoplethysmography; vital signs measurement.

Grants and funding

This research is partially supported by the Innovation and Technology Fund (ITF) Technology Start-up Support Scheme for Universities (TSSSU+), Research Talent Hub for Incubatees and I&T Tenants of the HKSTPC and the Cyberport (RTH-SPC), HKSTP Incubation Program and HKUST Sports Science and Technology Research Grant (SSTRG).