feat: M2 zero-copy foundation — EGL→CUDA import + NVENC CUDA-frame path · 16a00563a8 - punktfunk - unom

feat: M2 zero-copy foundation — EGL→CUDA import + NVENC CUDA-frame path

Scaffolding for dmabuf zero-copy (plan §9), opt-in via LUMEN_ZEROCOPY:

- src/zerocopy/{cuda,egl}.rs: hand-rolled CUDA Driver-API FFI (no Rust crate
  exposes the EGL-interop calls / CUeglFrame) with a shared process-wide
  CUcontext + pitched device buffers; an EGL importer (GBM platform on the
  NVIDIA render node) that turns a dmabuf into an EGLImage, registers it with
  CUDA, and copies it device-to-device into an owned buffer. `zerocopy-probe`
  subcommand validates the FFI/linking/GPU access — confirmed on the box
  (driver 595, EGL_EXT_image_dma_buf_import + modifiers).
- CapturedFrame gains a FramePayload enum (Cpu(Vec<u8>) | Cuda(DeviceBuffer));
  the encoder branches: CPU keeps the expand+upload path, CUDA wraps the device
  buffer in an AV_PIX_FMT_CUDA frame fed straight to hevc_nvenc (sharing our
  CUcontext via a hand-declared AVCUDADeviceContext, since ffmpeg-sys doesn't
  bind hwcontext_cuda.h). open_video/the encoder take a `cuda` flag derived from
  the first frame's payload.

The capture-side dmabuf negotiation (which produces the Cuda frames) is the
next step; the CPU path is unchanged and remains the default + fallback. Builds
clean, clippy clean, tests pass.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

This commit is contained in:

Enrico Bühler

2026-06-09 15:13:05 +00:00

parent b64be1dc33

commit 16a00563a8

12 changed files with 777 additions and 70 deletions

									
										crates/lumen-host/src/gamestream/stream.rs
									
		+1
		
												View File
												
				@@ -121,6 +121,7 @@ fn stream_body(

				        frame.height,

				        cfg.fps,

				        cfg.bitrate_kbps as u64 * 1000,

				        frame.is_cuda(),

				    )

				    .context("open NVENC for stream")?;

				    // FEC overhead percent (Sunshine default 20). Override with LUMEN_FEC_PCT (0 = data-only).