-
Notifications
You must be signed in to change notification settings - Fork 1k
Vectorized hash grouping #7316
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vectorized hash grouping #7316
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #7316 +/- ##
==========================================
+ Coverage 80.06% 81.90% +1.83%
==========================================
Files 190 245 +55
Lines 37181 45285 +8104
Branches 9450 11315 +1865
==========================================
+ Hits 29770 37089 +7319
- Misses 2997 3730 +733
- Partials 4414 4466 +52 ☔ View full report in Codecov by Sentry. |
This commit has various assorted refactorings and cosmetic changes: * Various cosmetic things I don't know where to put. * The definitions of aggregate functions and grouping columns in the vector agg node are now typed arrays and not lists. * The aggegate function implementation always work with at most one filter bitmap. This reduces the amount of code and will help to support the aggregate FILTER clauses. * Parts of the aggregate function implementations are restructured and renamed in a way that will make it easier to support hash grouping. * EXPLAIN output is added for vector agg node that mentions the grouping policy that is being used.
We didn't properly resolve INDEX_VARs in the output targetlist of DecompressChunk nodes, which are present when it uses a custom scan targetlist. Fix this by always working with the targetlist where these variables are resolved to uncompressed chunk variables, like we do during execution.
Co-authored-by: Erik Nordström <[email protected]> Signed-off-by: Alexander Kuzmenkov <[email protected]>
The continuous aggregate incremental refresh test accidentally used the now() function which makes it fail. Replace it with fixed dates.
This case was handled incorrectly and led to a segfault when grouping by multiple columns, one of which is a UUID segmentby column.
|
This pull request has been automatically marked as stale due to lack of activity. This pull request will be closed in 30 days. |
|
This was split out into other prs |
some experiments
Parts: