Detailed Design of a Distributed Cache

We'll cover the following...

This lesson identifies limitations in the high-level design and refines the architecture to address them.

Find and remove limitations

Before we get to the detailed design, we must resolve three specific challenges:

Service discovery: Cache clients have no mechanism to detect when cache servers are added or fail.
SPOF and performance: Using a single server for a dataset creates a Single Point of Failure (SPOF). Additionally, frequently accessed data (hotkeys) can overload a single node, degrading performance.
Server internals: The design lacks details regarding internal data structures and eviction policies.

We will address the service discovery problem first. The following ...