跳转至

Extract rows from shapefile(从 shapefile 中提取行)

Supported in: Batch

Reads a dataset of files and parses each shapefile into rows. All files except .shp, .shx and .dbf files will be ignored. This shapefile parser only supports point, polyline, polygon and multipoint geometry types. The output dataset will have a geometry column, and a column for each property listed by the user, apart from the _error and _file columns. If the user provides no properties to extract, the entire properties struct will be extracted into a properties column as a string. UTF-8 is the only supported encoding for property names and values (even if a .cpg file that specifies an alternative coding exists, it will be ignored).

Transform categories: File, Geospatial

Declared arguments

  • Dataset: Dataset of shapefiles to process. Each shapefile must have a .shp, .shx and a .dbf file. All files of a shapefile must have the same name. For example, a dataset with these files has two shapefiles (shapefile1, and shapefile2): shapefile1.shp, shapefile1.shx, shapefile1.dbf, folder/shapefile2.shp, folder/shapefile2.shx, folder/shapefile2.dbf.
    Files
  • optional List of properties: List of properties and their types that need to be extracted from these shapefiles. If no value or an empty struct is provided, extracts all properties in one 'properties' column as a string.
    Type\
  • optional Source coordinate system: Coordinate system identifier formatted as "authority:id". For example, UTM zone 18N could be identified by EPSG:32618. If not specified, will default to WGS84 which is EPSG:4326. Input geometries will be converted from this coordinate system to EPSG:4326 in order to standardize downstream transformations.
    Literal\

中文翻译


从 shapefile 中提取行

支持:批处理(Batch)

读取文件数据集并将每个 shapefile 解析为行。除 .shp、.shx 和 .dbf 文件外,所有其他文件将被忽略。此 shapefile 解析器仅支持点(point)、折线(polyline)、多边形(polygon)和多点(multipoint)几何类型。输出数据集将包含一个几何列(geometry column),以及用户列出的每个属性对应的列,但不包括 _error 和 _file 列。如果用户未提供要提取的属性,则整个属性结构(properties struct)将作为字符串提取到一个属性列(properties column)中。属性名称和值仅支持 UTF-8 编码(即使存在指定其他编码的 .cpg 文件,也会被忽略)。

转换类别:文件、地理空间(Geospatial)

声明的参数

  • 数据集(Dataset): 要处理的 shapefile 数据集。每个 shapefile 必须包含 .shp、.shx 和 .dbf 文件。同一 shapefile 的所有文件必须具有相同的名称。例如,包含以下文件的数据集包含两个 shapefile(shapefile1 和 shapefile2):shapefile1.shp、shapefile1.shx、shapefile1.dbf、folder/shapefile2.shp、folder/shapefile2.shx、folder/shapefile2.dbf。
    文件(Files)
  • 可选 属性列表(List of properties): 需要从这些 shapefile 中提取的属性及其类型的列表。如果未提供值或提供空结构,则将所有属性提取到一个字符串类型的 'properties' 列中。
    类型\
  • 可选 源坐标系(Source coordinate system): 坐标系标识符,格式为 "authority:id"。例如,UTM 18N 区可标识为 EPSG:32618。如果未指定,则默认为 WGS84(即 EPSG:4326)。输入几何图形将从该坐标系转换为 EPSG:4326,以标准化下游转换。
    字面量\