What is a task scheduler?

A task is a unit of computational work requiring resources (CPU, memory, storage, or bandwidth) for a specific duration. For example, uploading media to Facebook or Instagram triggers several background tasks:

Encoding the photo or video into multiple resolutions.
Validating the media for content monetizationContent monetization is a way of leveraging content so that a service can profit from it as users consume it., copyrights, and other policies.

Although these tasks are required to fully process and distribute the content, the upload request returns immediately. Compute-intensive processing runs asynchronously in the background, keeping the user-facing workflow responsive. When a user posts a comment on Facebook, the UI updates optimistically before backend confirmation completes. The distribution of that comment to followers is handled asynchronously by background workers coordinated through a task scheduler.

In any system, tasks compete for limited resources. A task scheduler mediates this competition by intelligently allocating resources to ensure both task-level and system-level goals are met.

When to use a task scheduler

Task schedulers are critical for efficiency. They enable systems to handle high workloads with limited resources, ensuring optimal utilization and uninterrupted executionRunning periodic tasks without user intervention to initiate the task execution again and again.. Common use cases include:

Single-OS nodes: Local OS schedulers use multi-feedback queues to allocate CPU time to competing processes on a single machine.
Cloud computing services: Cloud environments manage billions of tasks from multiple tenants across distributed resources. A local OS scheduler cannot scale to this level; a distributed solution is required to efficiently manage resources across many machines.
Large distributed systems: Platforms like Facebook or Instagram generate billions of asynchronous requestsAsynchronous requests are those that are not on a client’s critical path and often have some delay tolerance. Users request these tasks, and the system completes the tasks while the requester does not need to wait for the completion of the work. Notifications about the final state of the task are communicated to the requester some time in the future. from user interactions. These systems rely on distributed schedulers to process tasks, such as notifications or feed updates, without blocking the user experience.

Note: Facebook uses its own distributed task scheduler, called Async. It prioritizes tasks based on urgency. For example, live stream notifications require low-latency execution, while friend suggestion jobs can run with lower priority and relaxed scheduling constraints.

Distributed task scheduling

Task scheduling is the process of assigning resources to tasks promptly. The illustration below contrasts an OS-level scheduler with a data center-level scheduler:

System Design: The Distributed Task Scheduler

What is a task scheduler?

When to use a task scheduler

Distributed task scheduling

How will we design a task scheduling system?