Skip to content

ws_dataset: add parallel_collect option to SQL query execution#59

Open
tig888 wants to merge 1 commit into
mainfrom
tig/parallel-collect-sql-queries
Open

ws_dataset: add parallel_collect option to SQL query execution#59
tig888 wants to merge 1 commit into
mainfrom
tig/parallel-collect-sql-queries

Conversation

@tig888
Copy link
Copy Markdown
Contributor

@tig888 tig888 commented Apr 9, 2026

Adds a parallel_collect parameter to _parse_sql_queries_polars and _execute_sql_queries_polars that uses pl.collect_all() to collect all lazy frames in parallel before concatenation.

Adds a `parallel_collect` parameter to `_parse_sql_queries_polars` and
`_execute_sql_queries_polars` that uses `pl.collect_all()` to collect
all lazy frames in parallel before concatenation.
@tig888 tig888 requested a review from jpc April 9, 2026 18:07
@jpc
Copy link
Copy Markdown
Member

jpc commented May 7, 2026

This is amazing but I was wondering if we should just always do it, no flags? You mentioned memory usage but did you confirm it's actually worse?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants