System Design Deep Dive: Real-World Distributed Systems/

...

Detailed Design of a Many-core System

Learn the detailed design of a many-core key-value store.

We'll cover the following...

Solution: Limitations of memory and Memcached
Solution: Core task allocation
- Metrics
- Evaluating parametric space
Summary

Now, we will look into the detailed design of the solution to the problems we have while implementing a key-value store on a many-core system. We'll be discussing the solution to these three problems:

Memory limitation of TilePro64
Limitations of a multi-threaded Memcached
Allocation of tasks to cores

Solution: Limitations of memory and Memcached

Our first challenge is that TilePro64 has a 32-bit instruction set, and we cannot assign more than 4GB of virtual address space to a single process. The second challenge identified how the use of global locks hurt the performance of our key-value store. Both problems can be solved by implementing a version of Memcached that supports multiple processes:

Use multiple processes to access the key-value store in their own address space and overcome the memory limitation.
Use data shardingBreaking data into smaller chunks. (a direct consequence of using multiple processes) to parallelize data access and stop the use of locks within a shard.

Use multiple processes

When using domain-specific processors, we run into unique problems like the TilePro64 having only a 32-bit virtual address space. So, rather than using a single process that uses multiple threads, we will use those threads to communicate with independent processes that contain the key-value data shard in their own dedicated address space. This will allow us to overcome the limitation of the 32-bit virtual address space, and each process will run its operations in serial while ...

Prologue

File Systems

Google File System (GFS)

Google Colossus File System

Facebook's Tectonic File System

Databases

Google Bigtable

Google Megastore

Google Spanner

Key-value Stores

Many-core Key-value Store

Scaling Memcache

SILT

Amazon DynamoDB

Concurrency Management

Two-phase Locking (2PL)

Google Chubby Locking Service

ZooKeeper

Big Data Processing: Batch to Stream Processing

MapReduce

Spark

Kafka

Consensus

Understanding Consensus: Two Generals, FLP, & Byzantine Generals

Two-phase Commit

State Machine Replication

Paxos

Raft

Epilogue

Detailed Design of a Many-core System

Solution: Limitations of memory and Memcached

Use multiple processes