Diving deeper into the rPPG components¶

This guide walks you through the inner workings of the Rppg.process_frame method.

In [1]:

Copied!





import matplotlib.patches
import matplotlib.pyplot as plt
import numpy as np

import yarppg

filename = "tests/testvideo_30fps.mp4"
fps = yarppg.helpers.get_video_fps(filename)
import matplotlib.patches
import matplotlib.pyplot as plt
import numpy as np

import yarppg

filename = "tests/testvideo_30fps.mp4"
fps = yarppg.helpers.get_video_fps(filename)

Overview¶

Remote Photoplethysmography (rPPG) typically involves three steps:

region of interest (ROI) identification
signal extraction
heart rate estimation

The yarppg.Rppg class combines all these steps in a convenient manner. By default, yarppg.Rppg() will give you a simple processor, that finds the lower part of a face and extracts the average green channel within the region.

In [2]:

Copied!

rppg = yarppg.Rppg()
rppg = yarppg.Rppg()

If you are only interested in the results, you can do something like this:

In [3]:

Copied!





results: list[yarppg.RppgResult] = []
for frame in yarppg.frames_from_video(filename):
    results.append(rppg.process_frame(frame))
plt.plot(np.array(results)[:, 0])  # plot rPPG signal
results: list[yarppg.RppgResult] = []
for frame in yarppg.frames_from_video(filename):
    results.append(rppg.process_frame(frame))
plt.plot(np.array(results)[:, 0])  # plot rPPG signal

Out[3]:

[<matplotlib.lines.Line2D at 0x25e40497350>]

No description has been provided for this image

Under the hood, rppg.process_frame calls

a roi detector's detect method
a signal extracor's (yarppg.Processor) process
a HrCalculators update method.

Let's define each component separately.

Region of interest detection¶

yarPPG comes with several different implementations of ROI detectors. By default, we use an AI-based face landmarker provided through Google's MediaPipe. The FaceMeshDetector applies the face landmarker model and extracts the region of the lower face, as is done by Li et al. (2014).

X. Li, J. Chen, G. Zhao, and M. Pietikainen, “Remote Heart Rate Measurement From Face Videos Under Realistic Situations”, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4264-4271, 2014 doi:10.1109/CVPR.2014.543

We can visualize the ROI mask, which is > 0 for each pixel of the ROI. Some segmenters, including the FaceMeshDetector, also return the bounding box of the detected face. We mark the bounding box as a red rectangle below.

In [4]:

Copied!





roi_detector = yarppg.FaceMeshDetector()

frame = next(yarppg.frames_from_video(filename))
roi = roi_detector.detect(frame)
plt.imshow(roi.mask > 0, cmap="Greys_r", aspect="auto")

assert (
    roi.face_rect is not None
)  # FaceMeshDetector also provides a bounding box.
x, y, w, h = roi.face_rect
rect = matplotlib.patches.Rectangle(
    (x, y), w, h, edgecolor="r", facecolor="none"
)
plt.gca().add_patch(rect)
plt.axis("off")
roi_detector = yarppg.FaceMeshDetector()

frame = next(yarppg.frames_from_video(filename))
roi = roi_detector.detect(frame)
plt.imshow(roi.mask > 0, cmap="Greys_r", aspect="auto")

assert (
    roi.face_rect is not None
)  # FaceMeshDetector also provides a bounding box.
x, y, w, h = roi.face_rect
rect = matplotlib.patches.Rectangle(
    (x, y), w, h, edgecolor="r", facecolor="none"
)
plt.gca().add_patch(rect)
plt.axis("off")

Out[4]:

(np.float64(-0.5), np.float64(1919.5), np.float64(1079.5), np.float64(-0.5))

Signal extraction¶

The default signal extractor (yarppg.Processor) simply calculates the average green channel within the region of interest. This can already be enough to estimate heart rate accurately, if there is no movement and no lighting changes.

The processor returns an RppgResult container, which includes some additional information besides the extracted value. For example, all processors return the mean color (R, G and B) of the ROI, regardless of the specific algorithm.

In [5]:

Copied!

processor = yarppg.Processor()
result = processor.process(roi)
print(result.value == result.roi_mean.g)
processor = yarppg.Processor()
result = processor.process(roi)
print(result.value == result.roi_mean.g)

True

Heart rate estimation¶

In order to perform heart rate estimation, we need to look at the rPPG signal over time. The yarppg.HrCalculator keeps an internal buffer of recent signal values and periodically updates the heart rate estimate. The PeakBasedHrCalculator identifies peaks in the rPPG signal and calculates heart rate from the average distance of peaks within the buffer window.

In your signal processing loop, you can call the update method in every iteration. The calculator will decide based on the update_interval attribute, whether to perform the calculation or to simply store the value in the buffer. Below, we set up the calculator to produce a new HR estimate with every 15th frame.

In [6]:

Copied!

hrcalc = yarppg.PeakBasedHrCalculator(
    fs=30, window_seconds=5, distance=0.6, update_interval=15
)
hrcalc = yarppg.PeakBasedHrCalculator(
    fs=30, window_seconds=5, distance=0.6, update_interval=15
)

We can see the internal buffer growing, when repeatedly calling update. Note that HR will be nan as long as the buffer is smaller than the expected window size. To clear the buffer, we call hrcalc.reset().

In [7]:

Copied!





for res in results[:5]:
    hr = hrcalc.update(res.value)
    print("Buffer lenght:", len(hrcalc.values), "HR:", hr)
hrcalc.reset()
print("Buffer lenght:", len(hrcalc.values), "- state cleared.")
for res in results[:5]:
    hr = hrcalc.update(res.value)
    print("Buffer lenght:", len(hrcalc.values), "HR:", hr)
hrcalc.reset()
print("Buffer lenght:", len(hrcalc.values), "- state cleared.")

Buffer lenght: 1 HR: nan
Buffer lenght: 2 HR: nan
Buffer lenght: 3 HR: nan
Buffer lenght: 4 HR: nan
Buffer lenght: 5 HR: nan
Buffer lenght: 0 - state cleared.

Putting everything together¶

We can combine all of the above tools to build a fully customizable and extendable rPPG processing loop (see the [yarppg.ui.simplest][yarppg.ui.simplest] loop for an equivalent implementation with a simplistic UI.)

In [8]:

Copied!





# Clear the previous state.
processor.reset()
hrcalc.reset()

results: list[yarppg.RppgResult] = []
for i, frame in enumerate(yarppg.frames_from_video(filename)):
    roi = roi_detector.detect(frame)
    result = processor.process(roi)
    result.hr = hrcalc.update(result.value)

    results.append(result)
    if i % 30 == 0:
        print(
            f"{i=} {(roi.mask > 0).mean()=:.1%} {result.value=:.2f}"
            f" {result.hr=:.2f}"
        )

plt.plot(np.array(results)[:, 0])
# Clear the previous state.
processor.reset()
hrcalc.reset()

results: list[yarppg.RppgResult] = []
for i, frame in enumerate(yarppg.frames_from_video(filename)):
    roi = roi_detector.detect(frame)
    result = processor.process(roi)
    result.hr = hrcalc.update(result.value)

    results.append(result)
    if i % 30 == 0:
        print(
            f"{i=} {(roi.mask > 0).mean()=:.1%} {result.value=:.2f}"
            f" {result.hr=:.2f}"
        )

plt.plot(np.array(results)[:, 0])

i=0 (roi.mask > 0).mean()=3.6% result.value=120.17 result.hr=nan

i=30 (roi.mask > 0).mean()=3.9% result.value=121.97 result.hr=nan

i=60 (roi.mask > 0).mean()=3.8% result.value=122.37 result.hr=nan

i=90 (roi.mask > 0).mean()=3.9% result.value=122.00 result.hr=nan

i=120 (roi.mask > 0).mean()=3.9% result.value=122.29 result.hr=nan

i=150 (roi.mask > 0).mean()=3.9% result.value=122.70 result.hr=23.67

i=180 (roi.mask > 0).mean()=3.9% result.value=121.92 result.hr=25.20

i=210 (roi.mask > 0).mean()=3.8% result.value=121.19 result.hr=23.40

i=240 (roi.mask > 0).mean()=3.9% result.value=121.87 result.hr=25.80

i=270 (roi.mask > 0).mean()=3.8% result.value=122.05 result.hr=26.60

Out[8]:

[<matplotlib.lines.Line2D at 0x25e405f9750>]