-
Notifications
You must be signed in to change notification settings - Fork 2.8k
Flink: Dynamic Sink: Use max write parallelism if none is supplied #14191
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few minor comments.
static int firstPositive(int first, int second) { | ||
if (first > 0) { | ||
return first; | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: new line.
} | ||
if (second > 0) { | ||
return second; | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: new line
if (second > 0) { | ||
return second; | ||
} | ||
throw new IllegalArgumentException("None of the supplied ints were positive!"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we print out both values to facilitate troubleshooting?
* @param rowData The data matching the provided schema. | ||
* @param partitionSpec The target table {@link PartitionSpec}. | ||
* @param distributionMode The {@link DistributionMode}. | ||
* @param writeParallelism The number of parallel writers. If set to {@literal <= 0}, will |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What if we set it to MAX_INT
. It is easier to calculate with min
later
This adds the option to automatically set the write parallelism to the max available write parallelism, i.e. the job parallelism.