Filter(筛选(Filter))¶
Supported in: Batch, Faster, Streaming
Filters the input dataset based on the specified filter condition.
Transform categories: Data preparation, Popular
Declared arguments¶
- Dataset: Dataset to filter.
Table - Filter condition: Condition to filter on. Values that return true are kept, others are removed.
Expression\
Examples¶
Example 1: Base case¶
Argument values:
- Dataset: ri.foundry.main.dataset.a
- Filter condition:
recently_serviced
Input:
| recently_serviced | tail_number |
|---|---|
| true | KK-150 |
| false | XB-120 |
| true | MT-190 |
Output:
| recently_serviced | tail_number |
|---|---|
| true | KK-150 |
| true | MT-190 |
Example 2: Base case¶
Description: Nulls are treated as false
Argument values:
- Dataset: ri.foundry.main.dataset.a
- Filter condition:
recently_serviced
Input:
| recently_serviced | tail_number |
|---|---|
| null | KK-150 |
| true | XB-120 |
Output:
| recently_serviced | tail_number |
|---|---|
| true | XB-120 |
中文翻译¶
筛选(Filter)¶
支持模式:批处理(Batch)、快速处理(Faster)、流处理(Streaming)
根据指定的筛选条件对输入数据集进行过滤。
转换类别:数据准备、常用
声明参数¶
- 数据集(Dataset): 需要筛选的数据集。
表格 - 筛选条件(Filter condition): 用于筛选的条件。返回真值的记录将被保留,其余记录将被移除。
表达式\<布尔值>
示例¶
示例1:基础案例¶
参数值:
- 数据集: ri.foundry.main.dataset.a
- 筛选条件:
recently_serviced
输入:
| recently_serviced | tail_number |
|---|---|
| true | KK-150 |
| false | XB-120 |
| true | MT-190 |
输出:
| recently_serviced | tail_number |
|---|---|
| true | KK-150 |
| true | MT-190 |
示例2:基础案例¶
说明: 空值(Null)将被视为假值(false)
参数值:
- 数据集: ri.foundry.main.dataset.a
- 筛选条件:
recently_serviced
输入:
| recently_serviced | tail_number |
|---|---|
| null | KK-150 |
| true | XB-120 |
输出:
| recently_serviced | tail_number |
|---|---|
| true | XB-120 |