punktfunk

unom/punktfunk

Fork 0

Files

T

History

enricobuehler 708c62788d

apple / swift (push) Successful in 57s

Details

ci / rust (push) Successful in 1m39s

Details

ci / web (push) Successful in 32s

Details

ci / docs-site (push) Successful in 31s

Details

android / android (push) Successful in 3m29s

Details

windows-host / package (push) Successful in 3m39s

Details

deb / build-publish (push) Successful in 3m7s

Details

decky / build-publish (push) Successful in 22s

Details

ci / bench (push) Successful in 4m43s

Details

docker / build-push (., web/Dockerfile, punktfunk-web) (push) Successful in 16s

Details

docker / build-push (ci, ci/fedora-rpm.Dockerfile, punktfunk-fedora-rpm) (push) Successful in 2m27s

Details

docker / build-push (--build-arg FEDORA_VERSION=44, ci, ci/fedora-rpm.Dockerfile, punktfunk-fedora44-rpm) (push) Successful in 3m24s

Details

docker / build-push (docs-site, docs-site/Dockerfile, punktfunk-docs) (push) Successful in 22s

Details

docker / build-push (ci, ci/rust-ci.Dockerfile, punktfunk-rust-ci) (push) Successful in 2m18s

Details

rpm / build-publish (bazzite, punktfunk-fedora-rpm) (push) Successful in 8m22s

Details

docker / deploy-docs (push) Successful in 21s

Details

rpm / build-publish (fedora-44, punktfunk-fedora44-rpm) (push) Successful in 7m53s

Details

feat(host/encode): VAAPI zero-copy dmabuf import (AMD/Intel GPU CSC)

Phase 2 of AMD/Intel support: the VAAPI encoder now takes the capture dmabuf
directly and does the RGB->NV12 colour conversion on the GPU's video engine,
eliminating the host-side de-pad + swscale CSC + upload the CPU path pays.

- capture: a vendor-neutral FramePayload::Dmabuf (dup'd fd + fourcc/modifier/
  layout). When zero-copy is on, the EGL->CUDA importer is unavailable (any
  non-NVIDIA host), and the backend is VAAPI, the capturer advertises LINEAR
  dmabuf and hands the raw buffer to the encoder instead of CPU-copying it.
- encode/vaapi: the encoder self-configures from the first frame's payload (no
  open_video signature change). The dmabuf arm wraps the buffer as an
  AV_PIX_FMT_DRM_PRIME frame and pushes it through a filter graph
  buffer(drm_prime) -> hwmap(vaapi) -> scale_vaapi=nv12 -> buffersink; the
  encoder takes NV12 surfaces straight from the sink. The Phase 1 CPU-upload
  path is kept as the other arm (used when capture produces CPU frames).

Live-validated on a Radeon 780M (real Sway/xdpw desktop capture): correct,
pixel-perfect HEVC, and ~10x less host CPU at 1440p (4.2s -> 0.4s of CPU for
300 frames) -- the de-pad/CSC/upload moves to the GPU. NVIDIA unchanged
(zero-copy still imports to CUDA; the passthrough path only engages on
non-NVIDIA hosts).

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-20 09:57:00 +00:00

punktfunk-core

refactor: drop milestone names + consolidate clients; loss-recovery & rumble fixes

2026-06-18 21:05:58 +00:00

punktfunk-host

feat(host/encode): VAAPI zero-copy dmabuf import (AMD/Intel GPU CSC)

2026-06-20 09:57:00 +00:00