AI Zero to Hero

Most AI content online is either too technical ("here's a 400-line PyTorch training loop") or too hand-wavy ("AI is like a brain, but digital!"). Neither helps if you just want to understand what's actually going on.

This course is the thing in between.

It's a structured 33-post journey from "what even is AI?" to "I can reason about agents, alignment, and where this is all headed." It's written for anyone curious — not just developers. No math beyond arithmetic. No coding required. The focus is on mental models, not tools.

If you can read a news headline and wish you actually understood it, you're in the right place.

What this course is

Concept-first. Every post starts with an analogy and a diagram before it introduces a technical term.
Plain English. Every ML term is defined the first time it appears, in ten words or less.
Visually heavy. Hand-drawn Mermaid diagrams on every post, using a consistent palette.
Short reads. 1500-2500 words per post. One coffee, one post.
A throughline. 33 posts across 6 modules, each building on the last. Skim the map below.

What this course is not

Not a coding tutorial. If you want to build a transformer from scratch in NumPy, this isn't it. (Come back after — you'll get way more out of Karpathy's zero-to-hero series.)
Not an academic reference. If it doesn't change how you think, it doesn't make the cut.
Not a video course. Every post is a self-contained read.
Not gatekept. If you're curious, you're ready.

The map

Six modules. Thirty-three posts. One story.

Module 1 — What AI Actually Is (live now)

The fog-clearing module. Start here even if you think you know.

What AI actually is (and isn't) — The four nested circles (AI, ML, DL, LLM) and one honest definition you can use every day.
A short, honest history of AI — Seventy years of rollercoaster: Dartmouth, two winters, AlexNet, ChatGPT.
The three tribes of AI — Symbolic, connectionist, Bayesian — and why every real system is a collaboration.
Why data changed everything — The real reason AI finally worked in 2012 (spoiler: not better math).
How to spot AI hype — A five-question field guide for every AI product pitch you'll hear this year.

Time investment: ~40 minutes total. Five coffees.

Module 2 — How Machines Learn (live now)

The mental model for "learning." No calculus, no code — just the one move behind every spam filter, every chatbot, and every game-playing AI.

What "learning" actually means for a machine — Machines don't learn, they fit. The one metaphor behind everything else in this module.
Supervised learning, in one story — A toddler in a park is doing exactly what a spam filter does. Told as one story.
Unsupervised learning, or why patterns appear on their own — Clustering, similarity, and embeddings — built from a party you walked into.
Reinforcement learning — the dog-treat model of learning — Why every game-playing AI, and the final polish on every chatbot, come from the same puppy trick.
Overfitting, underfitting, and why your model lies to you — The single most important failure mode in ML, told through a student who memorised the practice test.
Features, labels, and the data diet — Why "garbage in, garbage out" isn't a slogan — it's the whole story.

Time investment: ~55 minutes total. Six coffees.

Module 3 — Neural Networks, Demystified (live now)

What a neural network actually is, told without ever asking you to compute a derivative. No calculus. No code. Six metaphors stacked on top of each other.

The neuron is a lie (and that's fine) — An artificial neuron is not a brain cell. It's a tiny voting committee. That swap unlocks everything else.
Why stacking layers works — Depth is a ladder of abstractions. Strokes become letters, letters become words, words become meaning. The single most important idea in deep learning.
Training a neural network — rolling downhill, blindfolded — Gradients and backprop, told as a blindfolded walk down a crumpled hillside. One tiny step at a time.
CNNs — looking at images with a flashlight — The architecture that cracked computer vision open in 2012. One little flashlight that slides across the image, looking for the same pattern everywhere.
RNNs — reading one word at a time — The old-school way of handling sequences. A neural network with a notebook. Why it worked, why it couldn't last.
Embeddings — turning everything into a point in space — The single most important idea in modern AI that isn't "neural network." Words, pictures, songs, users — all points on one invisible map.

Time investment: ~65 minutes total. Six coffees.

Module 4 — The Transformer Revolution (live now)

The architecture behind every LLM you've heard of. Attention, the block, the scroll, the three-act training story, and the bitter lesson that drove the whole industry to "just make it bigger."

Attention, explained without equations — Every word in a sentence walks into a library, glances at every other word, and copies the relevant content into its own notebook. That's the whole revolution.
The transformer, piece by piece — Inside one transformer block: attention, feed-forward, residual connections. Stack them and you get ChatGPT.
Tokens, context windows, and why length matters — LLMs don't read words, they read chunks. The scroll they can see at once is finite, and it explains half the weird behaviour you've noticed.
Pretraining, finetuning, RLHF — the three-act training story — How a raw language model becomes a polite assistant. Three acts, three teachers, one shipped chatbot.
Scaling laws and the bitter lesson — Why bigger keeps beating cleverer. The most uncomfortable truth in AI research, and the force behind every billion-dollar training run.

Time investment: ~55 minutes total. Five coffees.

Module 5 — Living with LLMs (live now)

The practical module. How LLMs actually generate, how to prompt them well, when to ground them with retrieval, the fine-tune vs prompt vs retrieve decision, how to evaluate, and the four characteristic failure modes.

What an LLM is actually doing — token by token — The five-step loop every chatbot runs. Why it hallucinates, why it can't plan, why it pauses.
Prompting as a skill, not a trick — Eight principles grouped into three families. The patterns that keep working as models change.
RAG — giving the model a library card — Retrieval-augmented generation as open-book exam. When it helps, when it fails, what to watch for.
Finetune vs prompt vs retrieve — when to use what — The decision framework nobody writes down. Prompt fixes instructions, retrieval fixes facts, fine-tuning fixes behaviour.
Evaluation — how do you know an LLM is any good? — Benchmarks, golden sets, human rating, LLM-as-judge, and vibes. An honest look at the least-solved problem in LLM work.
The failure modes of LLMs — Hallucination, sycophancy, jailbreaks, bias. What causes each one, what you can do about each one, and what you can't.

Time investment: ~65 minutes total. Six coffees.

Module 6 — AI in the Wild (live now)

The "so what now?" module. The field zoomed out: what AI is going out into the world as, what to worry about, and how to keep learning after this course ends.

Agents — LLMs that can do things — A chatbot with tools and a loop. Why the intelligence is the easy part and reliability is the hard part.
Multimodal AI — models that see, hear, and act — One encoder per modality, one shared embedding space, one set of operations. How a sentence, an image, and a song all become points on the same map.
The alignment problem, honestly — No doom, no dismissal. What serious researchers actually worry about and what gets overstated.
AI and your job — the honest answer — Your job is a bundle of tasks. AI is reshaping which parts are cheap and which are valuable. Reshape or get left behind.
Where this all goes next — and how to keep learning — The final post of the course. Frontier research, realistic forecasts, and an opinionated guide to staying curious.

Time investment: ~50 minutes total. Five coffees.

How to read this course

In order, if you can. Each module builds on the last.
One post at a time. They're designed to be read over coffee, not binged.
Out loud sometimes. The analogies stick better that way.
Skeptically. If a post doesn't change how you see something, leave a comment — I want to know.

You don't need to take notes. You don't need to do anything between posts. The goal isn't to memorise terms; it's to build a mental map that sticks.

When you're ready, start here:

👉 M1.1 — What AI actually is (and isn't)

A note on what you'll find elsewhere in this blog

If you poke around, you'll find some older posts that go deeper into code — things like neural networks from scratch, training loops, PyTorch examples. Those are a separate, more developer-focused track. They're still valuable if you want that — but the main AI Zero to Hero experience is the track you just saw above. No code required.

Cover photo via Unsplash.

AI Zero to Hero — Start Here

What this course is

What this course is not

The map

Module 1 — What AI Actually Is (live now)

Module 2 — How Machines Learn (live now)

Module 3 — Neural Networks, Demystified (live now)

Module 4 — The Transformer Revolution (live now)

Module 5 — Living with LLMs (live now)

Module 6 — AI in the Wild (live now)

How to read this course

A note on what you'll find elsewhere in this blog

Comments

A Short, Honest History of AI — From Turing to ChatGPT

More from this blog

A Reading List and Two Habits: Staying Current in Ten Minutes a Week

What to Decide Now, What to Defer, What to Ignore: The AI Action Matrix

The Next 18 Months of AI: A Calibrated Leader's Forecast

Calibrating Your AI Exposure: Upside and Downside in One Matrix

Five AI Capabilities That Matter for Your Business, and Five That Do Not

Command Palette

What this course is

What this course is not

The map

Module 1 — What AI Actually Is (live now)

Module 2 — How Machines Learn (live now)

Module 3 — Neural Networks, Demystified (live now)

Module 4 — The Transformer Revolution (live now)

Module 5 — Living with LLMs (live now)

Module 6 — AI in the Wild (live now)

How to read this course

A note on what you'll find elsewhere in this blog

Comments

AI Zero to Hero

A Short, Honest History of AI — From Turing to ChatGPT

More from this blog