How to read this book
This handbook covers everything in the AI Fundamentals course — all seven modules and 32 lessons — reorganised into four sections and ten chapters you can read like a normal book. No prior knowledge assumed, no maths required, and absolutely no breathless hype. Each chapter ends with a short practical exercise, because confidence comes from doing, not just reading.
It's a living book: the online version is kept current as models and tools change, and the chapters covering specific tools carry a note showing when their facts were last checked. If your PDF is getting on a bit, pop back for a fresh one.
Contents
What's Inside
2. How LLMs Work: Prediction, Tokens & Temperature
3. Does It Actually Understand?
5. Hallucinations, Bias & the Limits of Pattern Matching
7. Choosing Your Tools: Free vs Paid
9. AI Literacy: Claims, Hype & Benchmarks
10. Your Starting Point: First Prompts & the 30-Day Challenge
What AI Actually Is
Chapter One
Machine Learning, Neural Networks & the Family Tree
Let's clear something up before we go anywhere: AI is not magic, it is not conscious, and it is not plotting anything. It's pattern recognition at a scale that happens to look like magic from a distance. This chapter gives you the foundations — and I promise there will be no maths.
How machines learn
Machine learning follows five stages, and once you've seen them you'll recognise them everywhere. First, gather data — thousands of examples of the thing you care about. Second, label it — this email is spam, this one isn't. Third, train — the system hunts for patterns that connect the inputs to the labels. Fourth, test — show it examples it's never seen and check whether the patterns hold. Fifth, deploy and refine — let it loose, and keep correcting it.
Notice who's doing the heavy lifting at the start: humans, labelling things. The model is only ever as good as the data it learns from — bad labels produce bad predictions, which is why data quality is the single most important factor in any machine learning project, and why human oversight never stops mattering.
Neural networks, sans mathematics
A neural network is layers of simple units passing signals to each other. Each connection has a weight — a number controlling how much influence one node has on the next. And here's the entire secret of AI training, the bit the documentaries make mystical: learning is just adjusting those weights until the network gets the right answers. That's it. A spam filter might have thousands of input features and millions of connections, but the principle is the same one you'd see in a network of six nodes.
The family tree: AI, ML and Deep Learning
The terms get used interchangeably at dinner parties, but they're nested, like Russian dolls. AI is the broadest: any system doing something that looks intelligent — including a chess engine running hard-coded rules. Machine learning is the subset that learns from data rather than following written rules. Deep learning is the subset of that which uses many-layered neural networks — the technology behind face recognition and, yes, ChatGPT. All deep learning is machine learning, all machine learning is AI, and none of the arrows run backwards.
The model zoo
One more piece of furniture for the mental model: there are several types of AI model, each solving a fundamentally different problem. Language models generate and understand text. Vision models interpret images. Classification models sort things into buckets. Prediction models forecast what happens next. Generative models create new content. Modern systems often combine several — a self-driving car uses vision models to see, prediction models to anticipate traffic, and classification models to identify objects, all working together.
Key insight
Machine learning isn't magic — it's pattern recognition at scale, and learning is just weight adjustment. The model is only as good as its data, which is why human judgement remains a permanent part of the system.
Put the kettle on and try this
Play the Teach the Model exercise — you label 20 items and watch accuracy climb in real time. Then explain "all deep learning is machine learning, but not the reverse" to someone in your house. If you can land it before the tea goes cold, Chapter One has done its job.
Chapter Two
How LLMs Work: Prediction, Tokens & Temperature
Now we get to the technology behind ChatGPT, Claude and Gemini: the Large Language Model. If you remember one sentence from this entire book, make it this one: an LLM is fundamentally a next-token predictor. Everything else is detail — but the details are rather wonderful, so let's have them.
The prediction game
When you send a prompt, the model doesn't "understand" your request the way a human would. It calculates probabilities for every possible next word in its vocabulary, picks one, appends it, and repeats — building the response one piece at a time, each choice narrowing the space of what can plausibly come next. That's why responses stream in word by word: you're literally watching the prediction loop run.
Tokens: the currency of AI
LLMs don't read words — they read tokens: chunks of words, subwords or characters. This matters practically. A "128K context window" doesn't mean 128,000 words; it means 128,000 tokens, roughly 96,000 English words. Code and non-Latin scripts produce far more tokens per word, so the same window holds less of them. Tokens determine both what fits in a conversation and what API usage costs — they are the currency of the whole business.
Training data: the diet
What an LLM "knows" comes from its training data — vast quantities of web text, books, code and more. Every strength, every gap, and every bias traces back to that diet. Understanding this single fact lets you anticipate where models will excel (anything well-represented in human writing) and where they'll struggle (anything rare, recent, or systematically missing from the record). We'll meet the consequences properly in Chapter Five.
Temperature: the randomness dial
Temperature doesn't make a model smarter or dumber — it controls randomness. At low temperature the model picks the most probable next token every time: focused, predictable, occasionally dull. At high temperature it samples from the longer tail: creative, surprising, occasionally unhinged. Factual work wants low; brainstorming wants high.
The pipeline, end to end
Put it together and your prompt's journey looks like this: raw text → tokenisation → embeddings (each token becomes a position in "meaning space") → attention (working out which words relate to which) → next-token prediction → output, looped until done. Here's why this matters to you: every stage amplifies what came before it. A clear, specific prompt produces cleaner tokens, sharper attention, and higher-confidence predictions. Quality in, quality out — the entire craft of prompting rests on this pipeline.
Key insight
LLMs predict plausible text; they don't retrieve verified facts. Once that sentence sits comfortably in your head, both their brilliance and their blunders stop being mysterious.
Put the kettle on and try this
Play the Next-Word Prediction Game to feel the loop from the inside, then paste a paragraph of your own writing into the Live Tokeniser and see how it gets chopped up. Two small games, and the most important machine of the decade stops being a black box.
Chapter Three
Does It Actually Understand? The Stochastic Parrot Debate
Here's a question worth a chapter of its own, because how you answer it shapes how wisely you use these tools: do large language models actually understand language, or are they — in the famous phrase — "stochastic parrots", very sophisticated pattern matchers repeating plausible noises?
The case for the parrot
The sceptics' argument runs like this: LLMs are trained only to predict the next token. They have no body, no experience of the world, no stakes in anything. When one writes movingly about grief, it isn't drawing on loss — it's drawing on the statistical shadow of every grief-stricken paragraph humans have written. The fluency is real; the understanding behind it, the sceptics say, is borrowed entirely from us.
The case against
The other camp replies: look at what these systems actually do. They solve novel problems that aren't in any training set, translate between concepts, and show internal structures that look suspiciously like representations of meaning. If something walks like understanding and quacks like understanding across millions of unpredictable conversations, at what point does the distinction stop mattering? And — the uncomfortable jab — isn't human understanding also pattern matching, run on biological hardware?
Where the illusion breaks
Happily, you don't need to settle a philosophy debate to use AI well — you need to know where the illusion holds and where it breaks. The practical evidence from the course's challenges: AI succeeds when a problem maps to patterns in its training data, and fails when genuine causal reasoning, novel logic, or real-world experience is required. Change a classic puzzle's wording slightly so the memorised answer no longer applies, and watch the confident machinery sail straight into the wall. That boundary — pattern recall versus genuine reasoning — is the single most useful thing to carry out of this chapter.
Key insight
You needn't decide whether AI "really" understands. You need to know that it succeeds on patterns and fails on novelty — and to verify accordingly. Pragmatism beats philosophy at the keyboard.
Put the kettle on and try this
Work through the Debate Position Explorer and vote on the eight claims — find out which camp you're actually in. Then try the break-the-pattern challenges and watch the illusion wobble in real time.
Strengths & Limitations
Chapter Four
What AI Excels At — and Where It Reliably Fails
The most important skill in working with AI isn't prompting, and it isn't tool choice. It's knowing the boundary: which tasks AI handles brilliantly, and which it fluffs with total confidence. Learn the boundary and you'll delegate like a seasoned manager; miss it and you'll either trust too much or — just as wasteful — too little.
The strengths
AI excels at tasks with clear patterns and abundant training data. Top of the list: writing and editing (drafts, summaries, tone shifts), summarisation of long material, translation, code generation, brainstorming (it never runs dry and never gets embarrassed), and explanation — it can pitch anything at any level, endlessly patiently. The common thread: each of these has millions of worked examples in the training data.
The reliable failures
And then there are the blind spots — not occasional glitches but reliable categories of failure that occur across every major model:
- Current events — training data has a cutoff; without live search, the model is answering from the past.
- Precise arithmetic and counting — it predicts text about numbers rather than calculating them.
- Citations and sources — it will invent plausible-looking references with complete composure (more in Chapter Five).
- Niche and local knowledge — the thinner the training data, the more confident the improvisation.
- Genuine novelty — problems requiring reasoning it hasn't seen patterns for, as we saw in Chapter Three.
- Knowing its own limits — it cannot reliably tell you when it doesn't know something. That job is yours.
Here's the liberating reframe: most failures trace back to a handful of root causes — missing data, no real calculation, no live knowledge, no self-awareness. Recognise the cause and you can predict the failure before it happens, then route around it: verify the citation, do the maths yourself, add a search step, or keep the human in charge.
Key insight
Knowing where the boundary sits — pattern-rich tasks on one side, novelty, precision and currency on the other — is the single most important skill for working with AI effectively.
Put the kettle on and try this
Rate the 15 real-world tasks in the Capability Explorer and see how your instincts score against the actual assessments. Then browse the Failure Gallery — ten categories of reliable failure, each with the workaround.
Chapter Five
Hallucinations, Bias & the Limits of Pattern Matching
This is the chapter that earns the book its keep. The three limitations here aren't bugs awaiting a patch — they're fundamental properties of how these systems work. Understanding them is the difference between an AI user and an AI liability.
Hallucinations: confident fabrication
An AI model will sometimes state things that are completely made up — names, dates, studies, court cases, API functions — with exactly the same confident tone it uses for true things. Why? Because of Chapter Two's golden sentence: it predicts probable next tokens; it doesn't retrieve verified facts. A plausible-sounding citation is, statistically, a very probable sequence of tokens. Whether the paper exists was never part of the calculation.
The defence is a verification habit, scaled to the stakes. Five strategies that work: check claims against original sources; ask the model itself what it's least confident about; cross-check important answers with a second model or a search engine; be especially suspicious of specifics (numbers, names, references — the hallucination hotspots); and for anything consequential, keep a human review as the final gate. The course has a printable five-step checklist worth keeping by your desk.
Bias: the mirror problem
AI outputs reflect the biases of their training data — which is to say, the biases of us, at scale, plus the assumptions baked into model design. It shows up in who gets pictured as a "CEO", whose perspective frames an answer, which defaults are assumed. The crucial point: it isn't malice; it's arithmetic on skewed data. Your role as the user is to be the critical filter — ask whose perspective is represented, what data was likely included or excluded, and whether the output would survive scrutiny from a different viewpoint.
Pattern matching's outer wall
And looming behind both: the limit we met in Chapter Three. AI looks like it reasons, but it's matching patterns, and where genuine causal understanding or novel logic is required, the illusion breaks — confidently. The practical takeaway bears repeating because it's the heart of safe use: always verify AI output on tasks that require true understanding, not just pattern recall.
Key insight
Hallucinations and bias are fundamental properties, not passing bugs. The fix doesn't live in the model — it lives in your habits: verify in proportion to the stakes, and stay the critical filter.
Put the kettle on and try this
Play a few rounds of the Hallucination Spotter — clicking the fabricated sentences sharpens your eye remarkably quickly. Then run the bias detection quiz and see how many of the six bias types you can name afterwards.
The AI Landscape
Chapter Six · Tool facts checked June 2026
The Big Three & Open Source
Armed with a working mental model, let's survey the landscape. The headline, as ever: there's no "best" — there's best for your task, and the differences are practical rather than mystical.
ChatGPT, Claude and Gemini
| Assistant | Choose when you need | Watch out for |
|---|---|---|
| ChatGPT (OpenAI) | One tool for everything; image generation; the largest plugin ecosystem; code with an execution sandbox | Can be verbose and formulaic; confident hallucinations |
| Claude (Anthropic) | Long, nuanced writing; deep document analysis; accuracy over speed | No image generation; smaller ecosystem |
| Gemini (Google) | Real-time web information; video and audio analysis; Google Workspace integration; massive context window | Less distinctive prose style |
The professionals' habit from the course is worth adopting on day one: when an answer matters, run the same prompt through two of them and compare. Thirty seconds, and routinely illuminating.
The open-source world
Alongside the big commercial models runs a thriving open-source ecosystem — models whose weights you can download, run on your own hardware, and fine-tune on your own data. Two myths to retire immediately: open-source does not mean lower quality (many open models match or beat closed ones on specific tasks, especially when fine-tuned), and it isn't only for programmers (though it helps).
When does open-source genuinely win? When data privacy is critical and nothing may leave your infrastructure; when you need deep customisation via fine-tuning on domain data; when cost at scale matters more than convenience; and when you can't accept a vendor changing the model underneath you. The trade-offs run the other way on convenience, support and cutting-edge capability — which is why most individuals start (and happily stay) with the commercial assistants.
Key insight
Think toolbox, not horse race: the big three differ in genuinely useful ways, and open-source is a serious option whenever privacy, customisation or cost-at-scale top your list.
Put the kettle on and try this
Take one question from your real life and ask all three assistants (free tiers are fine). Note the differences in tone, depth and accuracy. Then try the open vs closed trade-off sliders and see how your instincts compare with the expert placement.
Chapter Seven · Tool facts checked June 2026
Choosing Your Tools: Free vs Paid
A blissfully practical chapter: what should you actually sign up for, and what should you pay? The short answer — less than you fear, and probably one subscription rather than three.
What the tiers actually buy
Every major platform offers a capable free tier — genuinely capable, not a crippled demo. Free gets you the standard models, with usage caps and slower access at peak times. The ~$20/month tier is the sweet spot for most professionals: the latest models, far higher limits, and the workhorse features — file uploads, web browsing, image generation. The premium tiers ($200+/month) and API pricing only make sense for power users and people building software on top of AI.
A sensible buying sequence
- Start free, everywhere. Open accounts on the big three, kick the tyres for a fortnight, and notice which one you keep returning to.
- Upgrade one. When you hit the free tier's limits — and if you're using AI properly, you will — pay for the one that fits your work. Not all three; one.
- Reassess quarterly. The landscape shifts fast (hence this book's living edition). A subscription that made sense in January may deserve a rethink by June.
And the value question answers itself with simple arithmetic: if a $20 subscription saves you two hours a month, it's paying for itself several times over at almost any professional wage. The course's ROI calculator will put your own numbers on it.
Matching tool to person
The honest tie-breakers, if the comparison table left you level: choose ChatGPT if you want one versatile tool with every feature bolted on; Claude if your work is mostly serious writing and long documents; Gemini if your life runs on Google Workspace or you lean heavily on current information. And remember the course's best line on the subject: the best AI tool is the one you'll actually use consistently. Start with one, learn it well, then expand — most professionals find two or three tools cover 95% of their needs.
Key insight
Free tiers are for exploring; $20/month is the professional sweet spot; premium is for power users. Pick one tool, learn it deeply, and let real usage — not marketing — drive any expansion.
Put the kettle on and try this
Run the 12-question needs assessment for a personalised recommendation, then the ROI calculator with your actual working patterns. If the numbers say upgrade, upgrade — and diarise a quarterly review.
Safety, Scepticism & Getting Started
Chapter Eight
Responsible Use: Privacy, Copyright & Governance
Using AI well means using it responsibly — not as a worthy afterthought, but because trust is the asset everything else depends on. This chapter covers the five practical territories: ethics, privacy, copyright, the environment, and governance.
Everyday ethics
Most AI ethics isn't trolley problems; it's workplace judgement calls. Is it appropriate to draft a colleague's performance review with AI? To summarise a confidential meeting? To generate a job application's cover letter? The course's ten scenarios sharpen exactly this judgement, and the pattern in the expert answers is consistent: transparency and stakes decide. Low-stakes drafting with your own review? Fine. Anything where the recipient would feel deceived, or where an undisclosed error could harm someone? Stop and rethink.
Privacy: the paste test
Every time you paste text into an AI tool, you're potentially sharing data with a third party. So classify before you paste. Safe: public information, your own non-sensitive drafts. Caution: internal business information — anonymise it, and check your organisation's policy and the platform's data settings. Never: client confidential data, personal data about others, credentials, anything regulated. The habit takes three seconds: "Would I be comfortable if this turned up somewhere unexpected?" If not, redact or refrain.
Copyright: the grey area
AI-generated content sits in a genuinely evolving legal area that varies by jurisdiction and use case. Sensible practice while the courts catch up: treat AI output as raw material rather than finished goods, add meaningful human authorship to anything you publish, don't prompt for imitations of living artists' styles for commercial work, and when real money or reputation rides on it, ask an actual lawyer. (I'm an educator, not your counsel — that sentence is doing legal work, mind.)
Environment: the honest footprint
AI has a measurable energy cost, and pretending otherwise helps no one. Keep perspective — an individual's chatbot use is modest beside, say, flying — but the habits are easy wins anyway: don't run heavyweight models on featherweight tasks, batch your work rather than meandering, and let one well-crafted prompt replace five sloppy ones. Efficiency and quality point the same way, pleasingly.
Governance: rules before incidents
Whether you're a freelancer or an enterprise, write your AI rules down before an incident, not after. A useful policy fits on a page: approved tools, prohibited data, where human review is mandatory, how AI use is disclosed, and who to ask when unsure. The course's policy generator drafts one from five questions — adapt it with your team rather than imposing it, because the conversation is half the value.
Key insight
Responsible use comes down to three habits: classify before you paste, disclose where it matters, and write the rules down before you need them. None of it is hard — all of it is easier than rebuilding trust.
Put the kettle on and try this
Take the ten-scenario ethics quiz and discover your AI Ethics Profile, then run the Policy Generator — even as a freelancer, a five-line personal policy beats good intentions every time.
Chapter Nine
AI Literacy: Claims, Hype & Benchmarks
Every week brings a fresh batch of AI headlines: revolutionary this, human-level that, jobs over by Tuesday. This chapter equips you with the working scepticism to read them all without either panicking or scoffing — the AI literacy that outlasts any particular model.
The six-criteria framework
Run any AI claim through six filters: evidence quality (data or anecdote?), specificity (numbers or adjectives?), reproducibility (can anyone else get this result?), source credibility (who's saying it?), conflict of interest (are they selling something?), and peer review (has anyone qualified checked?). Most impressive-sounding claims collapse against just two or three of these. A claim that's low on specificity and lacks review is almost always marketing, not science.
The partial truth problem
The most dangerous hype isn't fiction — it's a genuine advance wrapped in exaggerated framing. A real 20% improvement becomes "revolutionary"; a narrow lab result becomes "human-level". The kernel of truth makes the exaggeration harder to spot and easier to share. The red flags cluster reliably: vague metrics, cherry-picked benchmarks, missing baselines, demos that are really adverts. Spot one flag and actively hunt for the others — they travel in packs.
Reading benchmarks like a grown-up
Every model launch arrives draped in benchmark scores. Four truths keep them in perspective. A higher score means better at that specific benchmark — a model 2% better on one test may be 5% worse on the one resembling your work. Scores depend heavily on evaluation methodology. Standardised tests measure narrow skills — passing a bar exam tests legal recall, not judgement, empathy or strategy. And some benchmarks have leaked into training data, making scores flattering rather than informative. The question that cuts through all of it: "How does it perform on tasks like mine?" — which only your own testing answers.
A news-reading ritual
For any AI article that excites or alarms you: read past the headline (the gap between headline and research is often enormous); find the primary source; apply the six criteria; ask who benefits from you believing it; and only then decide whether to share, act, or worry. Healthy scepticism, not cynicism — most AI news is somewhere between the hype and the dismissal, and your job is simply to locate it.
Key insight
AI literacy is a transferable skill that outlives every model release: six criteria for claims, an eye for the partial truth, and a firm grip on what benchmarks do and don't measure.
Put the kettle on and try this
Play the Hype-o-Meter — fifteen real headlines, rated hype to accurate — and get your hype-detection score. Then take the next AI headline that crosses your feed and run it through the six criteria before you let it move you.
Chapter Ten · Tool facts checked June 2026
Your Starting Point: First Prompts & the 30-Day Challenge
Enough theory — beautifully necessary theory, but enough. This final chapter turns everything into action: accounts, first prompts, and a 30-day plan that converts knowledge into habit.
Setting up properly
Open free accounts on ChatGPT, Claude and Gemini — all three, because comparison is the fastest teacher. During setup, spend the five minutes most people skip: review the privacy settings. Each platform has different defaults for whether your conversations are used for training; configure them deliberately, especially if work content will pass through (you know the rules from Chapter Eight). Start free; upgrade later, once usage tells you which tool deserves it.
Your first prompts
Begin with low-stakes, genuinely useful tasks, and climb the specificity ladder. Ask for an explanation of something you half-understand. Summarise a long article. Draft an email you'd write anyway. Then notice the lesson hiding in your own results: specificity gets better outputs. "Write me an email" gives you mush; adding who it's for, what it must achieve and the tone you want transforms the result. You're discovering prompt engineering from first principles — the Mastering AI Tools course takes it from here.
And read responses critically from day one: AI sounds equally confident when right and wrong (Chapter Five), so spotting hedging language, checking specifics, and asking "what's missing?" are reading skills to practise on your very first conversation.
The 30-day challenge
Confidence comes from practice, not theory. The course's 30-day challenge builds the habit one small task a day: week one, basic interactions — questions, summaries, definitions, a first AI-assisted email; week two, creating — drafts, brainstorms, outlines, style experiments; week three, analysing — documents, data, decisions; week four, integrating — building AI into your actual daily workflows. Ten minutes a day for a month beats a weekend binge by every measure that matters: habits stick, marathons don't.
Where next
Finish the month and you'll have outgrown this book — which was the plan all along. Your onward path depends on appetite: the Mastering AI Tools course (and its companion handbook) for hands-on skills across the full toolbox; the productivity course for workflow automation; or, if you lead a team, the AI-Native Leadership intensive. Wherever you head: keep the scepticism, keep the verification habit, and keep the kettle warm.
Key insight
You now know how AI works, where it breaks, how to choose tools, and how to use them responsibly. The only remaining ingredient is practice — ten minutes a day for thirty days, and "AI user" stops being aspirational and becomes simply true.
Put the kettle on and try this — one last time
Today: set up your three accounts with the setup checklist and try three prompts from the Guided Prompt Lab. Tomorrow: start day one of the 30-Day Challenge. That's it. That's the whole assignment. Off you pop.
You've reached the end — of the book, not the learning
For the interactive versions of everything here — the visualisers, games, quizzes and calculators across all 32 lessons — head to the AI Fundamentals dashboard. Ready for more? The Mastering AI Tools Handbook picks up exactly where this book leaves off. And remember: this is a living book — check back for the latest edition, or grab a fresh PDF whenever the landscape shifts.