The Cloudflare Blog: Just landed: streaming ingestion on Cloudflare with Arroyo and Pipelines

Apr 10, 2025

—

Source URL: https://blog.cloudflare.com/cloudflare-acquires-arroyo-pipelines-streaming-ingestion-beta/
Source: The Cloudflare Blog
Title: Just landed: streaming ingestion on Cloudflare with Arroyo and Pipelines

Feedly Summary: We’ve just shipped our new streaming ingestion service, Pipelines — and we’ve acquired Arroyo, enabling us to bring new SQL-based, stateful transformations to Pipelines and R2.

AI Summary and Description: Yes

Summary: The text announces the launch of Pipelines, a new streaming ingestion product designed for real-time data handling without requiring users to manage underlying infrastructure. It emphasizes the integration of this service with the R2 storage and the recent acquisition of Arroyo to enhance data transformation capabilities.

Detailed Description:
The text outlines several important aspects of the newly launched Pipelines product, which is geared towards data ingestion and processing. Here are the major points highlighted:

– **Streaming Ingestion**: Pipelines allows businesses to ingest high volumes of structured, real-time data efficiently, alleviating the need for users to manage backend infrastructure.

– **Acquisition of Arroyo**: The acquisition of Arroyo, a cloud-native distributed stream processing engine, enhances the Pipelines product by enabling users to transform and process data streams in real-time.

– **Integration with R2**: Pipelines seamlessly integrates with R2, Cloudflare’s object storage service, providing durable storage capabilities and allowing users to write their data efficiently for querying.

– **Ease of Use**: Developers can create data pipelines with simple commands, facilitating quick setup and minimized learning curves compared to established solutions like Apache Kafka.

– **High Throughput and Scalability**: Pipelines can handle up to 100,000 records per second, accommodating high data ingestion volumes and ensuring data availability during peak usage.

– **Focus on Operational Simplicity**: By using technologies like Durable Objects and SQLite, Pipelines ensures data is immediately persisted, optimizing both performance and user experience.

– **Future Enhancements**: The text hints at future enhancements including transformations, integration with additional data sources, and the introduction of user-defined functions. It also indicates plans for making Pipelines available on free usage tiers.

– **Pricing Structure**: Initially, there will be no additional charge for using Pipelines, but a pricing model based on data ingestion is expected in the future.

Overall, the launch of Pipelines is a significant step in making real-time data processing more accessible and manageable for professionals in data science and engineering, reflecting a trend towards position-based operation and integration in cloud services. The potential for integrating with advanced systems like Arroyo and object storage hints at a powerful future for data processing capabilities.

1 10 2 a access acquisition AI and Apache Apache Kafka Arroyo as availability backend based business by C capabilities CI Cloud cloud service cloud services cloud-native Cloudflare co command D data Data Handling Data Ingestion data pipeline data pipelines data processing data science data sources data transformation de DeFi design developer developers durable object Durable Objects e efficient end Engineer engineering exp experience fine for free function future g H high Highlight HP HR http HTTPS in infrastructure integration ite J Just k Kafka l land learning led Li Lite low making man media mini ML Mode model N native no o object storage of on operation opt out over performance Pipeline pipelines point potential Power pricing pricing model pricing structure process processing product product design professionals Q QUIC R R2 R2 storage rag rate RCE real real-time real-time data real-time data processing red Ro s s Position scalability science sec service services Sig Sim Simple simplicity solutions source sql sqlite SSE SSL state storage Streaming streaming ingestion structured system systems T tech technologies text the throughput Time time data time data handling time data processing to Tor TP transformation transformations UI under up US usage use user user experience Users V Wi x