Files
punktfunk/docs-site/content/docs/status.md
T
enricobuehler 7b10714b62
ci / web (push) Failing after 38s
ci / rust (push) Successful in 53s
docker / build-push (., web/Dockerfile, punktfunk-web) (push) Successful in 3s
docker / build-push (ci, ci/rust-ci.Dockerfile, punktfunk-rust-ci) (push) Successful in 4s
docker / build-push (docs-site, docs-site/Dockerfile, punktfunk-docs) (push) Successful in 16s
ci / docs-site (push) Failing after 39s
docker / deploy-docs (push) Successful in 16s
apple / swift (push) Successful in 1m17s
feat(apple): stage-2 presenter — explicit decode + Metal present + glass-to-glass
Opt-in (Settings -> Presenter; `punktfunk.presenter`, default stage-1). Stage-1's
AVSampleBufferDisplayLayer decodes AND presents internally with no per-frame
callback, so neither decode nor present can be stamped or hand-paced. Stage-2
takes explicit control:

- VideoDecoder: VTDecompressionSession, async output callback stamps
  decode-completion, session rebuilt on every IDR / format change. Unit-tested
  (testVideoDecoderAsyncCallbackDeliversPixels).
- MetalVideoPresenter: CAMetalLayer + CVMetalTextureCache + a runtime-compiled
  BT.709 limited-range NV12->RGB shader, present at the next vsync. The
  CVMetalTextures + pixel buffer are held until the GPU completes.
- Stage2Pipeline: pump thread -> decoder -> newest-ready 1-slot ring; the hosting
  view's display link drains it once per vsync and stamps capture->present
  (the display-link target time projected into CLOCK_REALTIME).
- LatencyMeter gains record(ptsNs:atNs:offsetNs:); the HUD shows a capture->present
  (glass-to-glass, modulo host render->capture) line, skew-corrected via
  clockOffsetNs. Measured live ~11 ms p50 vs ~2.2 ms capture->client.
- StreamView / StreamViewIOS host the CAMetalLayer as a sublayer + a CADisplayLink
  (NSView.displayLink on macOS) when stage-2; input capture + HUD unchanged. The
  session-active gates switch from `pump != nil` to `connection != nil` so capture
  engages without a StreamPump.

Validated: builds macOS/iOS/tvOS; the decode half is unit-tested; the Metal
present is live-validated on glass (correct image + the capture->present number).
Colorspace is BT.709 SDR for now; 10-bit/HDR + a pacing policy are later.
Plan: docs-site/content/docs/apple-stage2-presenter.md.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
2026-06-12 15:29:23 +02:00

5.9 KiB
Raw Blame History

title, description
title description
Status & Progress Where the work stands, what's live on each box, and a running progress log.

The living progress tracker. Milestone-level status lives in CLAUDE.md and the design in the Implementation Plan; this page is the current state + a dated log of what landed, kept up to date as work happens. Newest first.

Milestones at a glance

Milestone State
M1punktfunk-core + C ABI (protocol · FEC · crypto) complete & hardened
M2 — GameStream host (Moonlight-compatible) working end-to-end; HDR/surround-audio polish open
M3punktfunk/1 native protocol (QUIC control + UDP data) full session planes, validated live
M4 — native client decode + present (Apple first) 🟡 stage 1 live; stage-2 presenter built + decode-tested (opt-in, present needs live validation)

Live on the boxes

Box Role Compositor Notes
home-worker-2 (dev) KDE/KWin appliance kwin (headless Plasma) QEMU VM, passthrough RTX 5070 Ti; serve --native user unit
home-worker-3 (GNOME) GNOME/Mutter appliance mutter (RecordVirtual) RTX 4090; autologin GNOME Wayland; serve --native user unit. See GNOME Box Setup
home-bazzite-1 SteamOS-like host gamescope host-managed Steam session at client mode. See Bazzite Setup

All three appliances advertise over mDNS (_punktfunk._udp) and require PIN pairing by default.

Progress log

2026-06-12

  • CI + deployment landed (see the CI & Docker guide). Gitea Actions, three workflows: Rust workspace checks inside the new punktfunk-rust-ci builder image (Ubuntu 26.04, full link-dep stack incl. a libcuda stub — 141/141 tests green in-container), web + docs-site build/typecheck, docker.yml building+pushing punktfunk-web/punktfunk-docs/punktfunk-rust-ci to the registry, and apple.yml (xcframework → swift build/swift test) on a new host-mode macOS runner (home-mac-mini-1, provisioned by scripts/ci/setup-macos-runner.sh; macOS Local-Network privacy forces it to run as a root LaunchDaemon). Host and native clients stay un-dockerized by design. This site now deploys automatically: deploy-docs ships it to unom-1:3220, Caddy serves https://docs.punktfunk.unom.io — live and verified.
  • Concurrent sessions — the host no longer serves one client at a time. The accept loop spawns each session (JoinSet), bounded by --max-concurrent (default 4, a NVENC bound; overflow waits in the accept queue). Each session keeps its own virtual output + encoder; they share the host-lifetime input/audio/mic services — i.e. multiple devices viewing/controlling the same desktop on kwin/mutter/wlroots. Validated live on the GNOME box: two clients connected at once → two independent Mutter virtual outputs (1280×720 + 1920×1080) streaming simultaneously (39 MB + 48 MB). gamescope's independent-desktops (multi-user) isolation — per-session input/audio — is a follow-up.
  • Apple client latency HUDPunktfunkConnection.clockOffsetNs (from the C-ABI getter) + LatencyMeter surface a skew-corrected capture→client-receipt p50/p95 in the macOS HUD: the first cross-machine latency the real Apple client reports. (Stage-1 AVSampleBufferDisplayLayer has no present callback, so decode→present is excluded — that needs the stage-2 presenter.) Needs an xcframework rebuild + swift test on the Mac to validate.
  • Skew handshake in the connector + C ABIquic::clock_sync is now a shared core helper used by both the reference client and NativeClient; the connector runs it at connect and exposes the host clock offset over the C ABI (punktfunk_connection_clock_offset_ns). This is the substrate the Apple client needs for the decode→present (glass-to-glass) term.
  • Wall-clock skew handshake (ClockProbe/ClockEcho, 8 NTP rounds after Start) — makes the client's capture→reassembled latency valid cross-machine. Validated GNOME box → dev box: offset 1.57 ms removed, p50 1.30 ms skew-corrected. (05bc9ab)
  • Native LAN auto-discovery — host advertises _punktfunk._udp (TXT: fingerprint, pairing, proto); punktfunk-client-rs --discover lists hosts. Validated cross-LAN. (4fff464)
  • Third test box stood up — home-worker-3 (Ubuntu 26.04, RTX 4090, GNOME 50): first GNOME/Mutter zero-copy streaming on a real desktop; 1 Gbps probe clean (625 MB/5 s, send_dropped=0). Two physical-NVIDIA gotchas documented in GNOME Box Setup.
  • Encode|send thread split validated on real NIC (send_dropped=0 at 720p60 / 1080p120). (b295a5b)

Earlier (see roadmap + git log)

  • 1 Gbps data plane: batched sendmmsg/recvmmsg + microburst-cap paced send thread.
  • Boot appliance: headless KDE session + host systemd units (no login).
  • Speed test + settable bitrate: negotiation + bandwidth probe (host side).
  • DualSense UHID + haptics; gamepads live; mic uplink; AV1 + surround (unit/live-capture tested).

In flight / next

See the Roadmap for the ordered list. Near-term:

  • True glass-to-glass: Apple client present-stamp (decode→present) + host render→capture term.
  • Apple stage-2 presenter (VTDecompressionSessionCAMetalLayer) — built + decode-unit-tested + live-validated on glass behind the punktfunk.presenter flag (capture→present ~11 ms p50); make it the default after a few resolution/HDR checks.
  • Mandatory PIN pairing + delegated pairing approval (an already-paired device approves a new one).
  • gamescope multi-user isolation — per-session input/audio so concurrent sessions are independent desktops (the shared-desktop multi-view case landed).
  • bazzite kept up to date (currently offline; one rebuild behind).