Distillation: The BYOL Algorithm
Learn about self-supervised learning via distillation and get an overview of the BYOL algorithm.
We'll cover the following
Distillation as similarity maximization
As shown in the figure below, distillation, in general, refers to transferring knowledge from a fixed (usually large) model known as teacher
Get hands-on with 1200+ tech skills courses.