Introduction to Masked Image Modeling

What is masked image modeling?

Inspired by NLP, masked image modeling (MIM) is a new self-supervised learning paradigm where the masked portion of the input is used to learn and predict the masked signal. This approach provides competitive results to approaches like contrastive learning.

