Home / AI GameChanger / AI Foundations
🧠 AI Foundations

Supervised, Unsupervised & Reinforcement Learning

Beginner ⏱ 5 min read 📘 Lesson 3 of 33

Every ML problem falls into one of three buckets. Knowing which one you have decides everything that follows.

1. Supervised learning — learning from labelled examples

You have inputs and the correct answers. The model learns the mapping. This is 90% of applied ML.

  • Classification — predict a category. Spam/not-spam, disease/healthy, which digit is this?
  • Regression — predict a number. House price, tomorrow's temperature, CGPA.

2. Unsupervised learning — finding structure with no labels

You have data but no answers. The model finds hidden patterns.

  • Clustering — group similar things. Customer segments, news topics.
  • Dimensionality reduction — compress features while keeping meaning (PCA, embeddings).

3. Reinforcement learning — learning by trial and reward

An agent takes actions in an environment and gets rewards/penalties. It learns a strategy that maximises reward over time. Powers game-playing AI (AlphaGo), robotics, and the "RLHF" step that made ChatGPT helpful.

Problem: "Predict if this transaction is fraud"       -> Supervised (classification)
Problem: "Group these 1M users into segments"          -> Unsupervised (clustering)
Problem: "Teach a bot to win at chess"                 -> Reinforcement
Problem: "Estimate a used car's resale price"          -> Supervised (regression)

Decision rule: Do you have labelled answers? → Supervised. No labels, want groups/structure? → Unsupervised. An agent acting to maximise reward? → Reinforcement.