Prefer NVENC for WebRTC H264 encoding by balthazur · Pull Request #2329 · roboflow/inference

balthazur · 2026-05-12T08:05:34Z

Summary

Prefer NVIDIA h264_nvenc for aiortc H.264 WebRTC encoding when available.
Fall back to the existing libx264 encoder when NVENC is unavailable or fails.
Add scoped [WEBRTC_NVENC] logs for patch activation, encoder selection, fallback, and resolution-triggered encoder recreation.

Why

Modal GPU workers run on NVIDIA GPUs, so H.264 hardware encoding can reduce server-side encode latency without changing the negotiated browser codec. H.264 stays the WebRTC-compatible path; this does not introduce H.265/HEVC or bitrate/congestion-control changes.

Testing

python3 -m py_compile inference/core/interfaces/webrtc_worker/h264_nvenc.py inference/core/interfaces/webrtc_worker/webrtc.py inference/core/interfaces/stream_manager/manager_app/webrtc.py
Local smoke test with repo venv and aiortc 1.14.0: confirmed h264_nvenc fallback to libx264 on a non-NVIDIA Mac still produces H.264 packets.
git diff --check

Staging benchmark

Encoder	Hardware	1080p encode time	Relative speed	Notes
`libx264`	CPU	~14-17 ms/frame steady-state	baseline	Works everywhere, but uses CPU and can become expensive at higher resolution/bitrate.
`h264_nvenc`	NVIDIA GPU	avg ~2.6 ms/frame, min ~2.1 ms, max ~3.1 ms	~5-6x faster	Uses the dedicated NVIDIA video encoder on T4/L4/L40S. Frees CPU and gives much more headroom for 1080p WebRTC output.

In staging, software libx264 encoding took roughly 14-17 ms per 1080p frame. With h264_nvenc, sampled encode time dropped to about 2.6 ms per frame, roughly a 5-6x improvement. This does not by itself solve all WebRTC congestion/ramp-up behavior, but it removes server-side H.264 encoding as a major bottleneck on NVIDIA GPU Modal workers.

For comparison, I also tested VP8: backend VP8 encode sampled around 10-17 ms/frame, while browser-side encode/decode stayed in the same low-ms range as H.264, so VP8 did not look like the better backend output codec.

Note

Medium Risk
Monkey-patches aiortc's H264Encoder to prefer GPU NVENC, which can affect WebRTC video stability/compatibility and may surface runtime/driver-specific failures despite the fallback path.

Overview
Prefers NVIDIA NVENC for WebRTC H.264 video encoding when available.

Adds h264_nvenc support via a runtime patch (prefer_h264_nvenc_encoder) that overrides aiortc.codecs.h264.H264Encoder._encode_frame to try opening an NVENC av.CodecContext, update bitrate, recreate the encoder on resolution change, and fall back to the original libx264 path on unavailability or encode errors (with [WEBRTC_NVENC] scoped logs).

Enables this behavior by invoking prefer_h264_nvenc_encoder() at import time in both the stream manager WebRTC app and the WebRTC worker entrypoints.

^{Reviewed by Cursor Bugbot for commit 8805940. Bugbot is set up for automated code reviews on this repo. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 6a9096f. Configure here.}

grzegorz-roboflow · 2026-05-13T08:59:03Z

lets measure this, CPU->GPU->CPU round trip might make this actually slower

balthazur · 2026-05-13T09:36:07Z

lets measure this, CPU->GPU->CPU round trip might make this actually slower

@grzegorz-roboflow what exactly do you want to measure? So far, I measured the raw encoding time. Let me know, I can run a comparison

Prefer NVENC for WebRTC H264 encoding

293035d

balthazur requested review from PawelPeczek-Roboflow, dkosowski87, grzegorz-roboflow, hansent, probicheaux, rafel-roboflow and yeldarby as code owners May 12, 2026 08:05

balthazur added 2 commits May 12, 2026 10:16

Format NVENC WebRTC helper

ba1f4c8

Sort WebRTC NVENC imports

6a9096f

cursor Bot reviewed May 12, 2026

View reviewed changes

Comment thread inference/core/interfaces/webrtc_worker/h264_nvenc.py Outdated

balthazur added 5 commits May 12, 2026 10:45

Use low-latency NVENC settings for WebRTC

934a41a

Recreate NVENC encoder on bitrate changes

735ef1b

Avoid NVENC reopen on bitrate changes

2d98294

Trim NVENC WebRTC logs

2ed2809

Merge remote-tracking branch 'origin/main' into codex/h264-nvenc-webrtc

8805940

balthazur mentioned this pull request May 12, 2026

Tune WebRTC bitrate ramp #2333

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer NVENC for WebRTC H264 encoding#2329

Prefer NVENC for WebRTC H264 encoding#2329
balthazur wants to merge 8 commits into
mainfrom
codex/h264-nvenc-webrtc

balthazur commented May 12, 2026 •

edited

Loading

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

grzegorz-roboflow commented May 13, 2026

Uh oh!

balthazur commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

balthazur commented May 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Testing

Staging benchmark

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

grzegorz-roboflow commented May 13, 2026

Uh oh!

balthazur commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

balthazur commented May 12, 2026 •

edited

Loading