Scan API and Engine Integrations by gatesn · Pull Request #44 · vortex-data/rfcs

gatesn · 2026-04-08T14:28:50Z

This RFC looks at how we can expose deeper integration with query engine internals like scheduling, threading models, buffer pools, and so on

Signed-off-by: Nicholas Gates <nick@nickgates.com>

robert3005 · 2026-04-08T16:30:06Z

proposed/0034-scan-api.md

+
+- The Scan API is not itself a full relational query engine.
+- `LayoutReader` should not grow unknown-cardinality operator semantics.
+- Vortex should not require a specific Rust async runtime such as Tokio.


It's weird that we would go through all of this and still assume Tokio but I haven't read all of it yet

We already don't assume tokio, it just continues to be an explicit goal

The double negation here implies the opposite? You want the goal to be that the runtime doesn't assume tokio? maybe I am reading too much into random ai generated strings

robert3005 · 2026-04-08T16:53:31Z

proposed/0034-scan-api.md

+
+- the host may provide a CPU scheduler
+- Vortex may use it for bounded split-local CPU work
+- Vortex must not assume ownership of the whole query runtime


What does this statement mean in practice? I think there's intent behind it but I fail to understand what this means in practice?

I'd guess in particular in terms of use of resources, e.g. spawning threads but also unix process ownership e.g. Vortex should never crash the host. @gatesn correct me if you had sth else in mind.

Vortex should never crash the host.

Error handling might deserve a small section in this PR. I briefly talked about this with @myrrc but I think we'll need a panic handler (the host maybe can configure) to prevent that we never crash a host.

robert3005 · 2026-04-08T17:35:45Z

proposed/0034-scan-api.md

+- split lookahead policy
+- efficient materialization of output batches
+
+### What `Partitioning` Means


Words are hard, partitioning usually means some arrangement of data which this is not about. But maybe this is Partitioning and the other thing is Arrangament

robert3005 · 2026-04-08T18:02:52Z

proposed/0034-scan-api.md

+
+Correctness is more important than maximal pushdown.
+
+## Ordering, Limits, and Future Dynamic Filters


You should mention Partitioning here (or a I redefined it Arrangement). It's a super set of ordering

0ax1

Just a thought, maybe worthwhile clauding some ascii diagrams to illustrate some of the aspects.

Scan API

83863ca

Signed-off-by: Nicholas Gates <nick@nickgates.com>

robert3005 reviewed Apr 8, 2026

View reviewed changes

0ax1 reviewed Apr 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scan API and Engine Integrations#44

Scan API and Engine Integrations#44
gatesn wants to merge 1 commit intodevelopfrom
ngates/scan-api

gatesn commented Apr 8, 2026

Uh oh!

robert3005 Apr 8, 2026

Uh oh!

gatesn Apr 8, 2026

Uh oh!

robert3005 Apr 9, 2026

Uh oh!

robert3005 Apr 8, 2026

Uh oh!

0ax1 Apr 9, 2026

Uh oh!

0ax1 Apr 9, 2026

Uh oh!

robert3005 Apr 8, 2026

Uh oh!

robert3005 Apr 8, 2026

Uh oh!

0ax1 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		Correctness is more important than maximal pushdown.

		## Ordering, Limits, and Future Dynamic Filters

Conversation

gatesn commented Apr 8, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

0ax1 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants