The Cloudflare Blog: Quicksilver v2: evolution of a globally distributed key-value store (Part 2)

Jul 17, 2025

—

Source URL: https://blog.cloudflare.com/quicksilver-v2-evolution-of-a-globally-distributed-key-value-store-part-2-of-2/
Source: The Cloudflare Blog
Title: Quicksilver v2: evolution of a globally distributed key-value store (Part 2)

Feedly Summary: This is part two of a story about how we overcame the challenges of making a complex system more scalable.

AI Summary and Description: Yes

Summary: The text describes the evolution of Cloudflare’s Quicksilver, a key-value database, highlighting its transition from a simple storage model to a more sophisticated, tiered caching architecture. This evolution addresses scalability and performance issues as the database size and request rates dramatically increase, underscoring the importance of efficient data access in cloud infrastructure.

Detailed Description:

The provided text details the advancements and challenges faced by Cloudflare with its Quicksilver database, illustrating significant aspects relevant to cloud computing infrastructure, particularly in terms of scalability, efficiency, and data management.

Key Insights and Major Points:

– **Quicksilver Overview**:
– Quicksilver is a key-value database essential for managing configuration data across Cloudflare’s global servers.
– It handles vast request volumes with a current capacity of over three billion keys per second.

– **Scalability Challenges**:
– Initially, Quicksilver stored all key-value pairs on every server, which quickly led to disk space limitations.
– With rapid data growth, the architecture needed to adapt to prevent performance degradation and excessive costs related to disk space.

– **Architecture Evolution**:
– **Version 1.5**: Introduced a proxy mode, where servers only held cached data rather than the full dataset. This initially reduced disk space usage but did not achieve balanced data distribution and still faced scalability issues.
– **Version 2**: Implemented a more refined architecture that utilizes tiered storage with specific focus on caching:
– **Level 1**: Local cache for frequently accessed data on each server.
– **Level 2**: Sharded, data center-wide cache to further optimize data retrieval.
– **Level 3**: Centralized replicas containing the complete dataset, utilized only when necessary.

– **Prefetching and Optimization Techniques**:
– Analyzing query patterns led to the implementation of reactive prefetching, which improved cache hit rates significantly by populating caches with data likely to be requested in the near future.
– The team observed that smaller data centers required less cache storage, leading to strategic enhancements in data locality through sharding.

– **Results and Performance Improvements**:
– With the new architecture, cache hit rates improved dramatically, reaching above 99.99% for worst-performing instances.
– The migration process was carried out smoothly without service interruptions, highlighting Cloudflare’s commitment to operational excellence.

– **Lessons Learned**:
– The development journey from Quicksilver V1 to V2 emphasizes the importance of iterative learning, strategic planning, and the need to design systems flexibly with an eye toward potential rollback.

The case of Quicksilver thus provides valuable insights for professionals engaged in cloud computing, infrastructure management, and system optimization, showcasing how to effectively tackle scalability within dynamic data environments while maintaining high availability and performance.

1 2 3 5 a access Act addresses advancement advancements AGI AI air and API Arch architecture art as at ated availability Bi by C Cache caching capacity cell centers challenge challenges CI Cloud cloud computing cloud computing infrastructure cloud infrastructure Cloudflare co commit Computing computing infrastructure Configuration cost Costs cross Current D data data access data center data centers data locality data management data retrieval database dataset de design development e effective efficiency efficient Entra environment event Excel face fine for full future g Global growth H high high availability Highlight HR http HTTPS implementation in infrastructure infrastructure management insights Instance inter io Iron IRS issue ite J k Key key-value pairs keys l Lance leading learning led level Level 2 Li limitations local M making man management migration migration process Mode model N new no o of on only operation operational excellence OPM opt optimization optimization technique optimization techniques ory oS out over Pair patterns per performance performance degradation performance improvement performance improvements performance issues phi planning point potential pre pro process professionals proxy ps Q QUIC R rag Rama rate RCE react reactive red retrieval Ro row s scalability scalable sec server servers service service interruptions SHA sharding Sig Sim Simple size small source specific SSE SSO storage stored strategic strategic planning system system optimization systems T Tails team tech techniques ted text the to Tor TP transition trie two UI under up US usage V val value store version Wi worst x z