Jargon is the biggest barrier to entry. Here are the terms that unlock 90% of AI articles and docs โ one honest line each.
Core ML
- Model โ the learned function that maps inputs to outputs.
- Weights / parameters โ the numbers inside the model that get tuned during training. "70B parameters" = 70 billion of them.
- Feature โ one input variable (age, pixel, word).
- Label / target โ the correct answer you train against.
- Training / inference โ learning the weights vs. using the trained model to predict.
- Epoch โ one full pass through the training data.
- Batch โ a chunk of examples processed together.
- Learning rate โ how big a step each update takes.
- Loss โ a number measuring how wrong the model is.
- Overfitting โ memorising the training data instead of learning the pattern (great on train, bad on new data).
- Generalisation โ performing well on data it has never seen. The actual goal.
Deep learning
- Neural network โ layers of simple units (neurons) that transform data.
- Activation function โ adds non-linearity so networks can learn complex shapes (ReLU, sigmoid).
- Backpropagation โ the algorithm that computes gradients through all layers.
- GPU / TPU โ hardware that does the massive matrix math fast.
- CNN / RNN / Transformer โ architectures for images / sequences / (now) everything.
LLM / GenAI
- Token โ a chunk of text (~ยพ of a word) the model reads and predicts.
- Embedding โ text/data turned into a vector of numbers that captures meaning.
- Prompt โ the input you give an LLM.
- Context window โ how many tokens the model can "see" at once.
- Hallucination โ the model stating something false but confident.
- Fine-tuning โ further-training a base model on your specific data.
- RAG โ retrieving relevant documents and feeding them to the LLM for grounded answers.
- Temperature โ randomness of the output; 0 = deterministic, high = creative.
Bookmark this โ you will return to it. Deep dives on each in the tracks below.