Getting started(入门指南)¶
:::callout{theme="warning"} Preparation has been superseded by Pipeline Builder and is therefore no longer the recommended approach for cleaning and preparing data. Pipeline Builder makes it easy to clean and prepare your data for pipelines, while also offering Marketplace support. :::
This page will help introduce you to the Preparation interface for cleaning and preparing datasets.
Open a dataset for cleaning and preparation¶
From a dataset¶
Choose Clean in Preparation from the Actions menu on any dataset to create a new preparation for that dataset.
From the Preparation screen¶
Click the Select a dataset... button, and select the dataset to clean/prepare.
Clean and prepare your data¶
- Click on a column to see an overview of the data in that column, and apply cleaning and preparation actions.
- There are two preparation views: Table (which looks like a spreadsheet) and Column (which shows a more compact card for each column).
- View the basic examples page for various ways to clean and prepare data.
Analyze in Contour while cleaning/preparing¶
- Click the Analyze button to open the current preparation in Contour.
- As you make changes to the preparation, Contour will prompt to update. Click the Update data button in Contour to refresh the analysis based on the preparation.
Save a cleaned or prepared copy of a dataset¶
- Click the Save as dataset button in the header bar.
- By default, this action will create an updating dataset that can be rebuilt based on changes to the underlying data set or changes to the preparation. To save off a one-off dataset, click the arrow beside the Save as dataset button and choose Save one-off dataset.
- Select where you want the dataset to be saved, then click Save. The dataset will begin building, and you will be notified when it is ready.

中文翻译¶
入门指南¶
:::callout{theme="warning"} Preparation 已被 Pipeline Builder 取代,因此不再推荐用于数据清洗和准备。Pipeline Builder 可轻松为管道清洗和准备数据,同时还提供 Marketplace 支持。 :::
本页将帮助您了解用于清洗和准备数据集的 Preparation 界面。
打开数据集进行清洗和准备¶
从数据集开始¶
在任何数据集的 Actions 菜单中选择 Clean in Preparation,即可为该数据集创建一个新的 preparation。
从 Preparation 界面开始¶
点击 Select a dataset... 按钮,选择要清洗/准备的数据集。
清洗和准备数据¶
- 点击某一列,查看该列数据的概览,并应用清洗和准备操作。
- 有两种 preparation 视图:Table(类似电子表格)和 Column(每列显示更紧凑的卡片)。
- 查看 基本示例 页面,了解各种清洗和准备数据的方法。
在清洗/准备时通过 Contour 进行分析¶
- 点击 Analyze 按钮,在 Contour 中打开当前的 preparation。
- 当您对 preparation 进行更改时,Contour 会提示更新。点击 Contour 中的 Update data 按钮,即可基于 preparation 刷新分析结果。
保存数据集的清洗或准备副本¶
- 点击标题栏中的 Save as dataset 按钮。
- 默认情况下,此操作将创建一个可更新的数据集,该数据集可根据底层数据集的变化或 preparation 的更改进行重建。如需保存一次性数据集,请点击 Save as dataset 按钮旁的箭头,然后选择 Save one-off dataset。
- 选择数据集的保存位置,然后点击 Save。数据集将开始构建,并在准备就绪时通知您。
