Streaming resource guide（流式资源指南）¶

This page lists the resources you may need to reference when implementing an end-to-end streaming workflow.

Data Connection supports syncing data from a wide variety of streaming platforms into Foundry streaming datasets, which can then be used in streaming pipelines. Streaming syncs enable data to flow into Foundry with low latency and high throughput to support real time decision-making processes.

There are two ways to sync data from streams into Foundry:

Data Connection supports pulling records from streaming platforms into Foundry. As with batch syncs, data is read from a stream and synced to Foundry using only unidirectional connections using the agent architecture.
If desired, Foundry enables pushing records from a stream directly into a Foundry stream via the stream proxy.

Foundry can connect to many sources of streaming data. Sources with dedicated connectors include:

Apache Kafka
Amazon Kinesis
Amazon SQS
Aveva PI
Google Pub/Sub

For streaming sources without a dedicated connector, you can connect to them using external transforms. This includes sources such as:

ActiveMQ
Amazon SNS
IBM MQ
RabbitMQ
MQTT [Beta]
Solace

This page lists the resources you may need to reference when implementing an end-to-end streaming workflow.

1. Core concepts¶

We recommend reviewing the following introductory concept page to understand what streams are, how they are stored, and how they are processed.

2. Overview¶

These pages will offer a broader scope of the various points to consider when determining if streaming is right for your use case or when deploying production streams.

3. Connect to data sources¶

You will need to complete one of the following workflows to connect your external data sources to Foundry for streaming. We recommend reviewing both options to understand possible benefits and limitations for your use case.

4. Transform your streaming data¶

You can use Pipeline Builder to transform your live data. Outputs of your Pipeline Builder transforms will still be streaming datasets that you can use in real time throughout Foundry.

5. Monitor streaming pipelines [Beta]¶

Set up alerting around your pipeline's health.

Stream monitoring

6. Development tools¶

Here, you can find tools to improve development of streaming pipelines.

Reset stream

中文翻译¶

流式资源指南¶

本页面列出了在实施端到端流式工作流时可能需要参考的资源。

Data Connection 支持将来自多种流式平台的数据同步到 Foundry 流式数据集(streaming datasets)中，这些数据集随后可用于流式管道。流式同步(Streaming syncs)能够以低延迟和高吞吐量将数据流入 Foundry，从而支持实时决策过程。

有两种方式可将数据从流同步到 Foundry：

Data Connection 支持从流式平台拉取记录到 Foundry。与批处理同步(batch syncs)类似，数据从流中读取并通过代理架构仅使用单向连接同步到 Foundry。
如有需要，Foundry 支持通过流代理(stream proxy)将记录直接从流推送到 Foundry 流中。

Foundry 可以连接到多种流式数据源。具有专用连接器的数据源包括：