[client] Refactor RemoteLogDownloader to use chunked file append instead of downloading whole log file by swuferhong · Pull Request #3263 · apache/fluss

swuferhong · 2026-05-07T11:29:50Z

Purpose

Linked issue: close #3262

Refactor RemoteLogDownloader from whole-file download to chunked streaming I/O. This significantly reduces time-to-first-byte latency, saves bandwidth when consumers stop mid-segment, and adds fine-grained flow control at the chunk level.

Brief change log

Tests

API and Format

Documentation

…ead of downloading whole log file

fresh-borzoni

@swuferhong Ty for the PR, that's an big improvement!

Now I want to backport it to rust part as well, I left some comments.
I hope you find them useful, PTAL

fresh-borzoni · 2026-05-13T17:08:33Z

+    /**
+     * Tests chunked download with a small chunk size, verifying that: 1. A large segment is split
+     * into multiple chunks. 2. Chunks can be consumed while subsequent chunks are still being
+     * downloaded (边读边下载). 3. Flow control (maxPrefetchChunks) pauses downloading when unconsumed


nit: mix of languages here

fresh-borzoni · 2026-05-13T17:11:24Z

+     * Request to read a remote log segment in chunks starting from the given position. This method
+     * is non-blocking and returns a future for the first chunk.
+     */
+    public RemoteLogDownloadFuture requestRemoteLog(


I think we introduced a race:
requestRemoteLog() adds the request to segmentsToFetch and returns. The caller then installs the next-chunk callback on the returned future. But the download thread is already running, so it can poll the request, read chunk 1, and call tryScheduleNextChunk(), which builds chunk 2's future by copying request.downloadFuture.getNextChunkCallback(), still null at that moment. Chunk 2 is read and completed, but nothing is listening,so the bucket silently stops at chunk 1.
Probably we can install the callback before publishing the request

fresh-borzoni · 2026-05-13T17:12:11Z

+
+    private final CompletableFuture<LogRecords> chunkFuture;
    private final Runnable recycleCallback;
+    private Consumer<RemoteLogDownloadFuture> nextChunkCallback;


fresh-borzoni · 2026-05-13T17:16:39Z

    }

    @Override
    public void close() throws IOException {


A request that's been polled and partially read but is paused is in neither segmentsToFetch nor continuationQueue.

close() only walks those two queues, so the open FSDataInputStream and the remote-chunk tmp file are left behind.

fresh-borzoni · 2026-05-13T17:24:09Z

+                                    + "A larger chunk size reduces the number of remote I/O requests but "
+                                    + "increases memory usage per chunk read. The default setting is 8MB.");
+
+    public static final ConfigOption<Integer> CLIENT_SCANNER_REMOTE_LOG_MAX_PREFETCH_CHUNKS =


I wonder what will happen if it's 0?

fresh-borzoni · 2026-05-13T17:28:22Z

-            long startTime = System.currentTimeMillis();
-            // download the remote file to local
-            remoteFileDownloader
-                    .downloadFileAsync(fsPathAndFileName, localLogDir)


The old fetchOnce dispatched the download to RemoteFileDownloader's thread pool (default 3) and returned immediately, so multiple segments could be downloading at once. The new processChunkRead does the fs.open + read inline on the dispatcher, so only one chunk is ever in flight.

this looks like it serializes remote reads, is it intentional or I'm missing smth?

fresh-borzoni · 2026-05-13T17:35:20Z

+                                // If toCompletedFetch() fails (e.g. the underlying chunk
+                                // future completed exceptionally), discard this entry so
+                                // the queue is not blocked. The bucket will become fetchable
+                                // again and the server can re-issue a remote fetch.


nit: I'm not sure: what if the issue is permanent? It would be a infitite loop with only warning. Should we have retry with backoff and then surface an error in the scanner?

fresh-borzoni · 2026-05-13T17:43:17Z

+
+            TableBucket tb = new TableBucket(DATA1_TABLE_ID, 0);
+            // Build a large segment with multiple records so multiple chunks are produced.
+            List<RemoteLogSegment> remoteLogSegments =


It doesn't look big or at least >8mb chunk size:

fluss/fluss-common/src/test/java/org/apache/fluss/record/TestData.java

Line 44 in 170e95f

public static final List<Object[]> DATA1 =

[client] Refactor RemoteLogDownloader to use chunked file append inst…

0994e8e

…ead of downloading whole log file

fresh-borzoni reviewed May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[client] Refactor RemoteLogDownloader to use chunked file append instead of downloading whole log file#3263

[client] Refactor RemoteLogDownloader to use chunked file append instead of downloading whole log file#3263
swuferhong wants to merge 1 commit into
apache:mainfrom
swuferhong:remote-log-download-slice

swuferhong commented May 7, 2026

Uh oh!

fresh-borzoni left a comment

Uh oh!

fresh-borzoni May 13, 2026

Uh oh!

fresh-borzoni May 13, 2026

Uh oh!

fresh-borzoni May 13, 2026

Uh oh!

fresh-borzoni May 13, 2026

Uh oh!

fresh-borzoni May 13, 2026

Uh oh!

fresh-borzoni May 13, 2026

Uh oh!

fresh-borzoni May 13, 2026 •

edited

Loading

Uh oh!

fresh-borzoni May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

swuferhong commented May 7, 2026

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

fresh-borzoni left a comment

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fresh-borzoni May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fresh-borzoni May 13, 2026 •

edited

Loading