Coalesce data(合并分区(Coalesce data))¶
Supported in: Batch, Faster
Operation to reduce the number of partitions. If you have 1000 partitions and you coalesce to 100 there will not be a shuffle, instead each of the 100 new partitions will claim 10 of the current partitions. If a larger number of partitions is requested, it will stay at the current number of partitions.
Transform categories: Other
Declared arguments¶
- Dataset: Dataset to perform coalesce on.
Table - Number of partitions: Number of partitions to coalesce to.
Literal\
中文翻译¶
合并分区(Coalesce data)¶
支持:批处理(Batch)、快速(Faster)
用于减少分区数量的操作。如果您有1000个分区,合并到100个分区时不会发生数据混洗(shuffle),而是每个新分区会接管当前10个分区的数据。如果请求的分区数量大于当前分区数,则保持当前分区数量不变。
转换类别:其他
声明参数(Declared arguments)¶
- 数据集(Dataset): 要执行合并操作的数据集。
表(Table) - 分区数量(Number of partitions): 合并后的目标分区数量。
字面量\<整数>(Literal\)