Lesson-03: RAS Design

RAS design

The jobs of RAS are to continuously optimize server assignments and provide guaranteed capacity to services during random and correlated failures, diverse capacity requests, heterogeneous hardware resources, and other such events.

A detailed architecture of RAS is explained in the following section.

Two-level architecture

Our goal is to isolate container placement from capacity allocation in order to enable region-wide optimizations for capacity allocation. The following figure shows the two-level architecture that describes the working of RAS and Twine.

The first level consists of two main components of RAS; the Async Solver and Online Mover. Async Solver uses Mixed-Integer-Programming (MIP) for region allocation and generates a Solver output. The Solver output is propagated to the Resource Broker that is going to perform the mapping between hardware resources that belong to certain MSBs to the service. This is the abstraction of reservation. If a failure occurs over the reservation and maintenance event then another component called the Mover is going to take a look at the reservation and move transparently the service to another allocated resource across another MSB.

Twine Allocator and Scheduler create the second level of the architecture. They place containers on the top of each reservation in real-time and manage its lifecycle through the process. The Health Check Service monitors all servers in the reservation.

A reservation represents a logical cluster consisting of resources that the Twine Allocator can use. Containers hosting services are placed on servers within a reservation. Reservations are grouped based on varying attributes such as the number of resources, hardware types, placement policies, and operating system (OS) configuration requirements.

RAS uses a concept called Relative Resource Unit (RRU) to conceal the number of heterogeneous hardware resources. A resource unit amount defined for each server represents the throughput of a particular server type. Upon request, a cumulative amount of resource units are reserved ...

Create a free account to access the full course.

By signing up, you agree to Educative's Terms of Service and Privacy Policy

Introduction

Abstractions

Non-functional System Characteristics

Back-of-the-Envelope Calculations

Building Blocks

Domain Name System (DNS)

Sequencer

Rate Limiter

Distributed Cache

Blob Store

Content Delivery Network (CDN)

Load Balancers

Key-Value Store

Distributed Messaging Queue

Pub-sub

Distributed Task Scheduler

Distributed Search

Distributed Logging

Distributed Monitoring

Monitoring Server Side Errors

Monitoring Client Side Errors

Databases

Sharded Counters

Concluding Building Blocks

Design YouTube

Design Quora

Design Google Maps

Designing a Proximity Server like Yelp

Design Uber

Design Twitter

Newsfeed System

Design Instagram

Design URL Shortening Service / TinyURL

Design a Web Crawler

Design WhatsApp

Design Typeahead Suggestion

Design Collaborative Document Editing Service / Google Docs

Spectacular Failures

Concluding Remarks

Appendix: System Design Interviews

All content below this will likely go away

Design Exercises

Archived temporary lessons

Design Resource Allocator for a Large Datacenter

Design Zoom

Continuous Monitoring using Data Processing

Design Live Commenting at Facebook

Security

For Noor: Placeholder for Illustration Making

Appendix

Backup of our Lessons

Caching Billions of Tiny Objects on Flash

Design Quora

Copy-Design YouTube

Identity & Access Management

Copy of CDN (02-03-2022)

Lesson-03: RAS Design

RAS design

Two-level architecture