Repartition data（重新分区数据 (Repartition data)）¶

Supported in: Batch, Faster

Forces a shuffle of the data based on optionally provided partitioning columns and a resulting number of partitions. If these are not provided, the partitioning will be determined automatically.

Transform categories: Other

Declared arguments¶

Dataset: Dataset to perform aggregate on.
Table
optional Incremental partition count: Number of partitions to reshuffle to if the build is incrementally updated.
Literal\
optional Number of partitions: Number of partitions to reshuffle to.
Literal\
optional Partitioning columns: Specifies the list of columns to be used for repartitioning.
List\>

中文翻译¶

重新分区数据 (Repartition data)¶

支持：批处理 (Batch)、快速 (Faster)

根据可选提供的分区列和结果分区数量强制对数据进行混洗 (shuffle)。如果未提供这些参数，分区方式将自动确定。

转换类别：其他

声明参数 (Declared arguments)¶

数据集 (Dataset)： 要执行聚合操作的数据集。
表 (Table)
可选 增量分区数 (Incremental partition count)： 当构建为增量更新时，重新混洗到的分区数量。
字面量\<整数> (Literal\)
可选 分区数 (Number of partitions)： 重新混洗到的分区数量。
字面量\<整数> (Literal\)
可选 分区列 (Partitioning columns)： 指定用于重新分区的列列表。
列表\<列\<任意类型>> (List\>)