The Cloudflare Blog: Eliminating Cold Starts 2: shard and conquer

Sep 26, 2025

—

Source URL: https://blog.cloudflare.com/eliminating-cold-starts-2-shard-and-conquer/
Source: The Cloudflare Blog
Title: Eliminating Cold Starts 2: shard and conquer

Feedly Summary: We reduced Cloudflare Workers cold starts by 10x by optimistically routing to servers with already-loaded Workers. Learn how we did it here.

AI Summary and Description: Yes

**Summary:**
The text discusses advancements in Cloudflare Workers to reduce cold start times through techniques such as pre-warming Workers during TLS handshakes and implementing a consistent hash ring for workload distribution. These innovations aim to enhance performance, reduce latency, and improve resource efficiency, which is crucial for developers and professionals working with serverless computing and cloud-based environments.

**Detailed Description:**
The content highlights the evolution of Cloudflare Workers technology, focusing on reducing cold start delays and optimizing request routing. It details a journey from initial cold start improvements to more sophisticated techniques that include sharding and dynamic request handling.

– **Cold Starts and Their Impact:**
– A cold start occurs when a Worker is instantiated for the first time, involving costly operations such as fetching and compiling source code.
– Originally, cold starts were faster than TLS handshakes, but as users deployed larger and more complex applications, this changed, necessitating the need for enhancement in Workers’ operational efficiency.

– **New Techniques Implemented:**
– **Worker Sharding:** A consistent hash ring mechanism is employed to optimize workload distribution across a global network, avoiding new cold starts by routing requests to already existing Worker instances whenever feasible.
– **Routing Efficiency:** The design aims to route requests not just based on availability but considering which instances can reduce cold starts, thereby maintaining a “warm request rate.”

– **Resource Management Improvements:**
– The system has dynamically increased resource limits (script sizes and CPU time) in response to user demand for more complex applications.
– This increase, while beneficial for application size, also raised cold start times, leading to new strategies aimed at reducing cold starts through better request management.

– **Innovation in Handling Requests:**
– Optimization techniques for managing requests include sharding based on prior cache mechanisms, reducing resource allocation while maintaining performance for popular workloads.
– Introduction of concepts like “shard client” and “shard server” aims to promote efficient multitasking and request management during peak loads.

– **Load Shedding Techniques:**
– To handle server overloads gracefully, two strategies are elaborated:
– Pre-emptively querying worker ability to handle requests (with potential added latency).
– Sending requests without prior confirmations and managing any failures efficiently.

– **Results and Metrics:**
– The efforts led to an impressive reduction in Worker eviction rates, improving system efficiency and achieving a dramatic increase in warm request rates, which enhances user experience significantly.
– Notably, enterprise traffic saw warm request rates improve from 99.9% to 99.99%, reflecting enhanced performance due to novel request routing strategies.

This text offers insightful implications for professionals in cloud and infrastructure security by demonstrating advanced methods of optimizing cloud workloads and resource management, which are fundamental to maintaining operational integrity and performance in highly dynamic serverless environments.

1 10 2 a Act ads advanced advancement advancements age AGI AI All and anti app Application applications ARM art as at ated availability based based environments Bi by C Cache cache mechanism cache mechanisms CI CIA cli client Cloud cloud workload cloud workloads cloud-based Cloudflare Cloudflare Workers co code Col cold starts Computing concept content cost CPU cross D de demand demo design developer developers dynamic request handling e efficiency efficient end enhanced performance enterprise environment environments ERP exp experience fail failures fast faster fetch first for full g glob Global global network Grace H handling hands handshake high Highlight HR http HTTPS impact implications improving improving system in infrastructure infrastructure security innovation Innovations Instance integrity io Iron IRS J Just k l Labor large latency leading led Li load load shedding M man management metrics multi multitasking N network new new strategies no o of off on ons operation operational operational efficiency operational integrity operations opt optimization optimization technique optimization techniques oS oss out over per performance phi potential pre pro professionals ps Q R Raise Rama rate RCE re ready red reduction request routing resource resource allocation resource efficiency resource management response Ro routing s sec security server serverless serverless computing serverless environment servers SHA sharding side Sig size sizes source source code SSE STAR start strategies system system efficiency T Tails Task tasking tech techniques technology ted text the Time times TLS to TP traffic two UN US use user user demand user experience Users V Wi workers workload workload distribution workloads x z