Ad CTR Prediction: Problem Framing & Requirements

Explore how to effectively frame the ad click-through rate prediction problem by anchoring your design in auction principles, business success metrics, and strict latency requirements. This lesson guides you through balancing accuracy, calibration, and engineering constraints to build scalable, production-ready ML systems for ad ranking.

We'll cover the following...

The eCPM equation and auction mechanics
Business metrics that define success
- Three stakeholders, three metric pillars
- Mapping offline metrics to business outcomes
The sub-100ms latency constraint
- Where the time goes
  - Cascading design implications
  - The data freshness challenge
Scoping by seniority level
Setting the stage for data and features

Every time a user scrolls through a feed on Meta, searches on Google, or swipes on TikTok, an ad auction fires in single-digit milliseconds. The ML model powering that auction predicts whether the user will click a given ad. A fraction-of-a-percent improvement in that prediction shifts billions of dollars in annual revenue, determines whether advertisers stay or leave the platform, and shapes the quality of every user’s experience. This is why ad click-through rate (CTR) prediction is one of the most frequently asked ML system design problems at MAANG companies.

The problem touches every pillar of system design simultaneously. Data pipelines must ingest billions of impression logs daily. Feature engineering must handle sparse, high-cardinality categorical data. Model architectures like Wide & Deep and DeepFM must balance memorization with generalization. Serving infrastructure must return predictions under brutal latency constraints. And continuous training pipelines must keep the model fresh as user behavior shifts hour by hour.

This lesson focuses on the critical first step that separates strong interview candidates from weak ones: problem framing and requirements. Before proposing any model or architecture, you need to anchor your design in three things: the auction math, the business metrics, and the latency constraint. Let’s build that foundation.

The eCPM equation and auction mechanics

The entire ad ranking system rests on a single equation:

1.The Interview Framework and Communication

2.Problem Formulation and Requirements

3.Data Strategy: Collection, Pipelines, and Features

4.Model Design and Architecture Selection

5.Evaluation: Offline, Online, and Fairness

6.Serving, Deployment, and MLOps

7.Case Study: Video Recommendation System

8.Case Study: Social Feed Ranking System

9.Case Study: Ad Click-Through Rate Prediction System

Mock Interview

10.Case Study: Semantic Search Engine

11.Case Study: Content Moderation System

Mock Interview

12.Case Study: Object Detection System

Mock Interview

13.Case Study: Visual Search System

Mock Interview

14.Case Study: Fraud Detection System

Mock Interview

15.Case Study: RAG-Based Enterprise Knowledge Assistant

16.Case Study: LLM-Powered Code Generation Tool

Ad CTR Prediction: Problem Framing & Requirements

The eCPM equation and auction mechanics