Design of a Key-Value Store

We'll cover the following...

Requirements
- Functional requirements
- Non-functional requirements
Assumptions
API design
- Data type

Requirements

We will design a key-value store that addresses the limitations of traditional databases.

Functional requirements

While standard key-value stores offer get and put operations, this design focuses on specific characteristics:

Configurable service: Applications often trade strong consistency for higher availability. The system must support configurable consistency models that allow users to balance availability, consistency, cost, and performance.

Note: These configurations are set when instantiating a key-value store instance and cannot be changed dynamically during operation.

Ability to always write: Applications must always be able to write to storage. This prioritizes availability over consistency (choosing “A” over “C” in the CAP theorem).

Note: Whether a requirement is classified as functional or non-functional depends on context. For example, in a shopping cart system, the ability to accept writes at all times (high availability) can be treated as a non-functional requirement. Following the Dynamo design principles, we treat “always write” as a functional requirement in this context.

Hardware heterogeneity: The system must seamlessly integrate new servers with different capacities without upgrading existing ones. Workload distribution should align with server capacity, favoring a peer-to-peer design without distinguished nodes.

Non-functional requirements

The system must meet the following non-functional requirements:

Scalability: The system must support tens of thousands of servers globally. Incremental scalability is essential, allowing servers to be added or removed with minimal service disruption.
Fault tolerance: The system must operate uninterrupted despite server or component failures.

Data type

The key serves as the unique identifier, while the value can be any arbitrary binary data.

Note: Dynamo uses MD5 hashes to generate a 128-bit identifier for the key. These identifiers determine which server node is responsible for the specific key.

The next lesson covers the key-value store design, with an emphasis on scalability, replication, and versioning. We start with non-functional requirements, since the chosen scalability strategy shapes how functional requirements are implemented.

Note: This chapter is based on DynamoAmazon’s Highly Available Key-value Store (https://assets.amazon.science/ac/1d/eb50c4064c538c8ac440ce6a1d91/dynamo-amazons-highly-available-key-value-store.pdf), an influential work in the domain of key-value stores.

Parameter	Description
`key`	It’s the `key` against which we want to get `value`.

Parameter	Description
`key`	It's the `key` against which we have to store `value`.
`value`	It's the object to be stored against the `key`.

Design of a Key-Value Store

Requirements

Functional requirements

Non-functional requirements

Assumptions

API design

Data type