Skip to content

Conversation

AdamGS
Copy link
Contributor

@AdamGS AdamGS commented Aug 2, 2025

This PR closes #14287, but is actually just a squash + rebase of #14411 by @Weijun-H., see the original PR for all the benchmarking results, rationale and thinking.

AFAICT there are no open discussions/comments there, but I would be happy to at least try and get this PR over the line as it seems like it offers some nice performance benefits.

@github-actions github-actions bot added documentation Improvements or additions to documentation physical-expr Changes to the physical-expr crates optimizer Optimizer rules core Core DataFusion crate sqllogictest SQL Logic Tests (.slt) common Related to common crate proto Related to proto crate physical-plan Changes to the physical-plan crate labels Aug 2, 2025
@alamb
Copy link
Contributor

alamb commented Aug 3, 2025

Thanks @AdamGS -- can you remind me what type of queries this should benefit? I am happy to start running some benchmarks / start reviewing some code

cc @crepererum

@AdamGS
Copy link
Contributor Author

AdamGS commented Aug 3, 2025

My understanding is that it should be beneficial in queries where the different partitions of the physical plan have heavy skew, allowing "lighter" partitions to pick up more work, which should both reduce the effect of these long-tail anomalies and reduce memory usage in cases where some producers are faster then their upstream consumer.

I'll work to fix the remaining test issues, seems like the plan changed a bit since the original code was written.

@AdamGS AdamGS force-pushed the on-demand-repartition-with-config branch from 574c138 to 9112ac9 Compare August 3, 2025 15:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
common Related to common crate core Core DataFusion crate documentation Improvements or additions to documentation optimizer Optimizer rules physical-expr Changes to the physical-expr crates physical-plan Changes to the physical-plan crate proto Related to proto crate sqllogictest SQL Logic Tests (.slt)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Alternative approaches to "fan-out" style RepartitionExec
3 participants