Fraud Detection: Problem Framing & Requirements

Explore how to frame fraud detection as a rare event problem with extreme class imbalance and asymmetric costs. Understand the importance of precision-recall metrics, the sub-50ms latency constraint, and architectural trade-offs. Gain insights into meeting interview expectations from mid-level to staff experts by aligning model design with real-time business and system requirements.

We'll cover the following...

Fraud as a rare event detection problem
Asymmetric cost structure and business metrics
- Defining the core business metrics
- Mapping asymmetry to model design
The sub-50ms latency constraint
- Why 50 milliseconds matters
  - Decomposing the latency budget
  - Architectural implications
Scoping by seniority: L4, L5, and Staff+
Summary

In autonomous driving, the system must detect obstacles in real time to prevent collisions. Financial fraud detection shares the same high-stakes, low-latency DNA but operates in a fundamentally different domain. Instead of LiDAR point clouds and pedestrian bounding boxes, the inputs are transaction amounts, device fingerprints, and behavioral signals. Instead of a car swerving into a guardrail, the failure mode is a stolen credit card draining thousands of dollars in seconds, or a legitimate customer blocked mid-checkout and lost forever.

Fraud detection is one of the most frequently tested ML system design problems in MAANG interviews, drawn directly from production systems at Stripe, PayPal, and major banks. Interviewers use it because it forces candidates to reason beyond model accuracy and into the territory of real-time serving infrastructure, business-metric alignment, and adversarial dynamics. A candidate who treats fraud detection as a simple binary classification task will miss the point entirely.

The core tension is deceptively simple. The system must catch fraudulent transactions while allowing legitimate commerce to flow unimpeded, all within milliseconds. This lesson frames that tension precisely. We will define fraud as a rare event detection problem, analyze the asymmetric cost structure that governs every design decision, establish the hard sub-50ms latency constraint, and close with a scoping comparison across L4, L5, and Staff+ interview expectations.

Fraud as a rare event detection problem

Legitimate transactions outnumber fraudulent ones by ratios of 1000:1 or higher in most production payment systems. This extreme class imbalanceA condition where one class (here, fraud) is vastly underrepresented relative to the other class (legitimate transactions) in the training and evaluation data. is not a minor inconvenience. It is the defining characteristic of the problem.

Consider what happens when you evaluate a model using standard accuracy. If only 0.05% of transactions are fraudulent, a model that labels every single transaction as legitimate achieves 99.95% accuracy. It catches exactly zero fraud. Accuracy is meaningless here.

Attention: In interviews, stating “we’ll optimize for accuracy” on a fraud detection problem is an immediate red signal. It tells the interviewer you haven’t internalized the class imbalance constraint.

The base rateThe natural frequency of the positive class (fraud) in the overall population of transactions. of fraud in the dataset determines which evaluation metrics are informative. Two metrics dominate this regime:

AUC-ROC: Measures the model’s ability to discriminate between classes across all threshold settings, but can appear optimistically high when the negative class overwhelms the positive class.
Precision-recall AUC (AUC-PR): Focuses evaluation on the minority class by measuring the trade-off between precision (of flagged transactions, how many are actually fraud) and recall (of all fraud, how many did we catch), making it the preferred metric for rare event detection.

Class imbalance is not just an evaluation problem. It cascades into every downstream design decision. The training pipeline must employ sampling strategies such as SMOTE or class-weighted loss functions. Threshold tuning becomes a core design concern rather than an afterthought. Monitoring must track per-class ...

1.The Interview Framework and Communication

2.Problem Formulation and Requirements

3.Data Strategy: Collection, Pipelines, and Features

4.Model Design and Architecture Selection

5.Evaluation: Offline, Online, and Fairness

6.Serving, Deployment, and MLOps

7.Case Study: Video Recommendation System

8.Case Study: Social Feed Ranking System

9.Case Study: Ad Click-Through Rate Prediction System

Mock Interview

10.Case Study: Semantic Search Engine

11.Case Study: Content Moderation System

Mock Interview

12.Case Study: Object Detection System

Mock Interview

13.Case Study: Visual Search System

Mock Interview

14.Case Study: Fraud Detection System

Mock Interview

15.Case Study: RAG-Based Enterprise Knowledge Assistant

16.Case Study: LLM-Powered Code Generation Tool

Fraud Detection: Problem Framing & Requirements

Fraud as a rare event detection problem