feat: implement WebSocket streaming protocol for large payloads #2256 by netsirius · Pull Request #3346 · freenet/freenet-core

netsirius · 2026-02-28T21:31:29Z

Problem

The WebSocket interface between clients and the Freenet node used a single monolithic message frame for all payloads, requiring a 100 MB max_message_size limit to handle large WASM contract uploads. This caused excessive memory pressure on both client and server, since the entire payload had to be buffered in a single WebSocket frame before processing could begin.

Solution

Introduce a streaming protocol for WebSocket connections, using the chunking primitives from freenet-stdlib 0.2.0. Clients opt in via a ?streaming=true query parameter; non-streaming clients are completely unaffected.

How it works

All message types are chunked when large enough. The chunking decision is based purely on payload size — any serialized response or request that exceeds CHUNK_THRESHOLD (512 KiB) is automatically split into 256 KiB StreamChunk frames, regardless of the HostResponse or ClientRequest variant.

Server → Client (responses):
When streaming is enabled and a serialized response exceeds CHUNK_THRESHOLD, the server:

Optionally sends a StreamHeader (Native encoding only) with stream_id, total_bytes, and content metadata so the client can pre-allocate or show progress. Currently only GetResponse includes this header.
Splits the payload into StreamChunk messages via chunk_response() from freenet-stdlib.
Sends each chunk as a separate WebSocket binary frame.
Yields to the tokio runtime every MAX_CHUNKS_PER_BATCH chunks so pings and other control messages can flow through the select loop.

Client → Server (requests):
freenet-stdlib 0.2.2+ automatically chunks large serialized requests (>512 KiB) into StreamChunk frames on the client side, regardless of the streaming query parameter. The server always reassembles them using ReassemblyBuffer before decoding the full ClientRequest.

Chunking vs StreamHeader — two layers

Layer	Applies to	Purpose
Chunking (`StreamChunk`)	All message types over `CHUNK_THRESHOLD`	Fragment and transparently reassemble any large payload
StreamHeader (optional metadata)	Only `GetResponse` + Native encoding	Pre-announce total size and content type for incremental client consumption

The StreamHeader is an optimization that enables recv_stream() on the client — incremental chunk-by-chunk processing via an async Stream<Item = Bytes>. Without a header, the client's recv() transparently reassembles chunks and returns the complete HostResponse. Both paths deliver correct data; the header just enables richer client UX (progress bars, pre-allocation).

This two-layer design is intentional: GetResponse is the primary large-payload case (contract WASM up to 50 MiB + state up to 50 MiB) and the one where incremental consumption matters most. Other potentially large responses like UpdateNotification with full state are chunked and reassembled transparently. The StreamContent::Raw variant exists as a placeholder for future extension.

Key design decisions

Opt-in streaming — backward compatible; existing clients work unchanged without the query parameter.
max_message_size kept at 100 MB — non-streaming clients still send large payloads (WASM contracts) as single frames. The ?streaming=true parameter controls chunked responses only; the client-side stdlib always chunks large requests automatically.
StreamHeader only for Native encoding — Flatbuffers clients (browser) use transparent reassembly via StreamChunk alone; the header is an optimization for bincode-based clients.
Cooperative yielding — after every MAX_CHUNKS_PER_BATCH chunks sent, the sender yields to the tokio runtime to prevent starving pings, subscriptions, and other control flow in the connection's select loop.
Wrapping stream IDs — per-connection u32 counter with wrapping_add; collisions would require 4B streams on a single connection.

Code changes

ConnectionState struct — consolidates per-connection state (encoding protocol, streaming flag, reassembly buffer, stream ID counter) instead of passing individual parameters.
send_response_message() — new helper that abstracts the "send as single frame or chunk" decision, used by all four response paths: host responses, subscription notifications, auth errors, and node-unavailable errors.
decode_client_request() — extracted as an inner function to avoid duplicating deserialization logic between normal requests and post-reassembly decoded requests.
EncodingProtocol now derives PartialEq, Eq — needed for conditional StreamHeader logic.

Bonus bugfix

The NodeUnavailable error path in process_host_response previously used bincode::serialize unconditionally, which would send malformed data to Flatbuffers clients. Now correctly serializes per the connection's encoding protocol.

Testing

Streaming protocol unit tests (chunking, reassembly, edge cases) are in freenet-stdlib (freenet-stdlib#58).
cargo test -p freenet passes (once freenet-stdlib 0.2.0 is published).

Dependencies

This PR must be merged after freenet-stdlib#58, which adds the streaming types (StreamChunk, StreamHeader, ReassemblyBuffer, chunk_response, etc.) in freenet-stdlib 0.2.0. CI will fail until that crate is published.

Files changed

File	Change
`Cargo.toml`	Bump `freenet-stdlib` to `0.2.0`
`Cargo.lock`, `apps/freenet-ping/Cargo.lock`	Lockfile updates
`crates/core/src/client_events/websocket.rs`	Streaming protocol integration, `ConnectionState`, reassembly, chunked send across all response paths
`crates/core/src/util/mod.rs`	Add `PartialEq, Eq` to `EncodingProtocol`

Fixes

Closes #2256

github-actions · 2026-02-28T21:33:40Z

🔍 Rule Review: WebSocket streaming protocol (PR #3346)

Rules checked: git-workflow.md, code-style.md, testing.md
Files reviewed: 7 (apps/freenet-ping/Cargo.{lock,toml}, apps/freenet-ping/app/tests/common/mod.rs, apps/freenet-ping/app/tests/run_app.rs, apps/freenet-ping/types/src/lib.rs, crates/core/src/client_events/websocket.rs, crates/core/src/util.rs)

Critical

No critical findings.

Warnings

No warning-level findings.

Info

crates/core/src/client_events/websocket.rs:691 — extract_stream_content uses a catch-all _ => None arm over HostResponse variants. If a future variant should produce StreamContent metadata it will silently return None with no compiler prompt. (rule: code-style.md — "Catch-all _ => in exhaustive matches")

Notes (not rule violations):

send_response_message yields via tokio::task::yield_now() every MAX_CHUNKS_PER_BATCH chunks. The inline comment says this lets "pings, subscriptions, and other control flow in the connection's select loop" proceed, but yield_now only yields to other tasks in the executor, not to other arms of the same select!. The ping arm of the surrounding loop cannot run until send_response_message returns. The yield is still useful (it helps other spawned tasks) but the comment overstates what it accomplishes. Worth a wording fix, not a blocking issue.
rand::rngs::StdRng::seed_from_u64(...) and tokio::time::sleep in the new integration test at apps/freenet-ping/app/tests/run_app.rs:168,310 are not in crates/core/, so the DST/GlobalRng/TimeSource rules do not apply there. ✓
ConnectionState::reassembly (ReassemblyBuffer) is bounded: MAX_CONCURRENT_STREAMS and MAX_TOTAL_CHUNKS are enforced and tested in the new unit tests. ✓
No tests were deleted or commented out; no bare .unwrap() in new production paths; no fire-and-forget spawns; no biased; added. ✓

To acknowledge findings: Check each box above once addressed (or a tracking issue opened),
or post /ack to dismiss all findings for this revision.
Merge is not blocked — the single INFO item is advisory only.

Automated review against .claude/rules/. Critical and Warning findings block merge — check boxes or post /ack to acknowledge.

…ctively

…ively

The rebase on main introduced a duplicate StreamChunk reassembly block (from main's a085374) alongside the PR's existing conn_state-based reassembly. Remove the duplicate and its unused reassembly_buffer variable.

ReassemblyBuffer in freenet-stdlib 0.2.2 already evicts incomplete streams after STREAM_TTL (60s) via evict_stale() on every receive_chunk() call. Update the comment to reflect this. Addresses rule-review warning about unbounded cleanup exemptions.

freenet-stdlib 0.2.2+ automatically chunks large ClientRequest messages (>512 KiB) on the client side regardless of whether streaming=true was passed in the WebSocket URL. The server was only reassembling StreamChunk when conn_state.streaming was true, causing non-streaming clients' large PUT requests to pass through as unrecognized StreamChunk variants and never get processed — resulting in 60s timeouts. The streaming flag should only control whether the server sends chunked responses, not whether it can receive chunked requests.

- Reassembly errors now send an error response to the client instead of killing the connection. A single malformed chunk (duplicate, out-of-range) no longer terminates the entire WebSocket session. - Restore cooperative yielding: send_response_message now yields to the tokio runtime every MAX_CHUNKS_PER_BATCH (64) chunks, preventing starvation of pings, subscriptions, and other select arms. - Avoid materializing all chunks in memory: send_response_message now serializes and sends each chunk individually instead of collecting all into a Vec<Vec<u8>> first, reducing peak memory by ~50% for large payloads. - Extract duplicated stream_content matching into extract_stream_content() helper (was copy-pasted in 2 places).

New tests covering review gaps: - NodeUnavailable encoding: verify Native path produces valid bincode and FBS path produces non-bincode bytes (encoding dispatch bugfix) - Non-streaming StreamChunk reassembly: verify streaming=false still reassembles chunked requests (freenet-stdlib always chunks large reqs) - FBS chunked validation: verify FBS chunks are non-empty, have overhead, and are not accidentally bincode-encoded - stream_id wrapping: verify u32::MAX wraps to 0 - extract_stream_content: verify GetResponse produces Some, others None

netsirius requested a review from iduartgomez February 28, 2026 21:31

This comment was marked as outdated.

Sign in to view

netsirius force-pushed the feat/ws-message-streaming branch from 79c11a4 to b0db600 Compare March 2, 2026 13:07

This comment was marked as outdated.

Sign in to view

iduartgomez added the claude-review label Mar 2, 2026

netsirius mentioned this pull request Mar 2, 2026

feat: implement WebSocket streaming protocol for large payloads #2256 freenet/freenet-stdlib#58

Merged

netsirius force-pushed the feat/ws-message-streaming branch 3 times, most recently from 425e84c to a19fd85 Compare March 9, 2026 17:14

netsirius added 13 commits March 10, 2026 12:44

feat: implement WebSocket streaming protocol for large payloads #2256

0c9ce22

feat: update freenet-stdlib to version 0.2.0 #2256

81d031c

ws streaming support improvements

3ea890c

use rate limiter directly in WebSocket connection state

744d8d4

leave the maximum WebSocket message size with the previous value

9846b5f

fmt

e0f7a2c

remove MAX_CHUNKS_PER_BATCH

097c60a

improve and add more tests

43ce66f

fix test_ws_streaming_large_payload

faae866

restore deleted tests

84b7d5a

add ws streaming tests

580b07a

update freenet and freenet-stdlib versions to 0.1.165 and 0.2.2 respe…

b5a6ba5

…ctively

bump freenet and freenet-stdlib versions to 0.1.178 and 0.2.2 respect…

8dc5d40

…ively

iduartgomez force-pushed the feat/ws-message-streaming branch from a19fd85 to 8dc5d40 Compare March 10, 2026 13:09

iduartgomez added 2 commits March 10, 2026 14:29

fix: remove duplicate StreamChunk reassembly block after rebase

811bac7

The rebase on main introduced a duplicate StreamChunk reassembly block (from main's a085374) alongside the PR's existing conn_state-based reassembly. Remove the duplicate and its unused reassembly_buffer variable.

This comment was marked as resolved.

Sign in to view

iduartgomez added 2 commits March 10, 2026 15:37

iduartgomez enabled auto-merge March 10, 2026 15:47

iduartgomez added this pull request to the merge queue Mar 10, 2026

Merged via the queue into main with commit 65f2bf3 Mar 10, 2026
10 checks passed

iduartgomez deleted the feat/ws-message-streaming branch March 10, 2026 16:15

netsirius mentioned this pull request Mar 12, 2026

feat: integrate WebSocket streaming client for large payloads freenet/river#165

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: implement WebSocket streaming protocol for large payloads #2256#3346

feat: implement WebSocket streaming protocol for large payloads #2256#3346
iduartgomez merged 18 commits intomainfrom
feat/ws-message-streaming

netsirius commented Feb 28, 2026 •

edited by iduartgomez

Loading

Uh oh!

github-actions bot commented Feb 28, 2026 •

edited

Loading

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

netsirius commented Feb 28, 2026 • edited by iduartgomez Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

How it works

Chunking vs StreamHeader — two layers

Key design decisions

Code changes

Bonus bugfix

Testing

Dependencies

Files changed

Fixes

Uh oh!

github-actions bot commented Feb 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔍 Rule Review: WebSocket streaming protocol (PR #3346)

Critical

Warnings

Info

Uh oh!

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

netsirius commented Feb 28, 2026 •

edited by iduartgomez

Loading

github-actions bot commented Feb 28, 2026 •

edited

Loading