Demand Control

Understand the concept of demand control in distributed systems, focusing on managing queues and resource limitations like sockets and bandwidth. Explore how high load leads to nonlinear system behavior and why systems may need to refuse work to maintain stability. Learn the impact of TCP buffers, listen queues, and retries on system performance under heavy demand.

We'll cover the following...

System crash
How systems fail
Sockets limitation
Ethernet
Firing a retry

System crash

In the old days of mainframes in glass houses, we could predict what the workload looked like from day to day. Operators would measure how many MIPS (millions of instructions per second) a given job needed. Those days are long gone. Most of our services are either directly or indirectly exposed to the entire world’s population.

Our daily reality is this: the world can crush our systems at any time. There’s no natural protection. We have to build it. There are two basic strategies: either refuse work or scale out. For the moment, we’ll consider when, where, and how to refuse work.

How systems fail

Every failing system starts with a queue backing up somewhere. When thinking about request/reply workload, we need to consider the resources being consumed and the queues to get access to those resources. That’ll let us decide where to cut off new requests. Each request obviously consumes a socket on each tier it passes through. While the request is active on an instance, that instance has one fewer ephemeral socket available for new requests. In ...

1.Living in Production

2.The Exception That Grounded an Airline

3.Stabilize Your System

4.Stability Antipatterns

5.Failures And Blockages

6.Force Multiplier

7.Stability Patterns

8.Launching An Online Store

9.Foundations

10.Processes on Machines

11.Interconnect

12.Control Plane

13.Security

14.Design for Deployment

15.Handling Versions

16.Case Study: Trampled by Your Own Customers

17.Adaptation

18.System Architecture

19.Information Architecture

20.Chaos Engineering

21.Bibliography

Demand Control

System crash

How systems fail