Python transforms with Spark(使用 Spark 的 Python 转换)¶
Python transforms can be configured with both single-node or multi-node compute engines. For general introductory materials on working with Python transforms across compute engines, including Spark, see Python transforms.
The section below contains documentation on Python transforms with Spark, including Spark-specific guides and references for working with distributed compute.
- Incremental pipelines with Spark
- Accelerating Spark with Velox
- Reading files in a repository
- How to output column metadata
- API reference for Python transforms (Spark)
- PySpark reference guide
中文翻译¶
使用 Spark 的 Python 转换¶
Python 转换可配置为使用单节点或多节点计算引擎。有关跨计算引擎(包括 Spark)使用 Python 转换的通用入门材料,请参阅 Python 转换。
以下部分包含使用 Spark 的 Python 转换文档,包括针对 Spark 的指南和分布式计算参考。