Inference

Explore the process of inference in machine learning, focusing on scaling prediction workloads using aggregators and worker pools. Understand strategies for serving multiple models, managing data distribution shifts, and applying techniques like Thompson Sampling to balance exploration and exploitation in dynamic environments.

We'll cover the following...