What is Self-Supervised Learning?

Question

Accepted Answer

A training paradigm where AI models learn from unlabeled data by creating their own supervisory signals, such as predicting masked words or future frames in video. Self-supervised learning has been called the key to unlocking AI's potential by Yann LeCun. It eliminates the need for expensive manual labeling — critical given that less than 1% of the world's data is labeled. Both BERT (masked language modeling) and GPT (next-token prediction) are self-supervised approaches. The technique has enabled training on internet-scale datasets of trillions of tokens. Self-supervised pre-training followed by supervised fine-tuning has become the dominant paradigm for building state-of-the-art AI systems.

Self-Supervised Learning

Explore the Data

Related Terms

Artificial General Intelligence (AGI)

AI Alignment

ChatGPT

Fine-Tuning

Foundation Model

Frontier Model

AI Economy Pulse