Tail Latencies at Scale: When Percentiles mislead
What Individual Request Latencies tell us
Let’s look at the following percentiles calculated by looking at 100 requests
P50 | P75 | P90 | P95 | P99 | P999 | P9999 |
---|---|---|---|---|---|---|
1 | 4 | 4 | 5 | 9.9 | 27.9 | 29.8 |
If you are looking at this data from individual requests made to the server, you can conclude the following
- 95% of the users are served within 5 seconds
Access this course and 1400+ top-rated courses and projects.