The Plain-English Explanation
When you type a message into ChatGPT, the model doesn't read individual letters or whole sentences. It breaks your text into tokens — chunks that might be a complete word ("hello"), part of a word ("un" + "believ" + "able"), or a punctuation mark. On average, one token is roughly three-quarters of an English word, so 100 words is about 130 tokens.
Tokens matter because every AI model has a token limit — the maximum number of tokens it can process in a single conversation. This limit covers both your input and the AI's output. When you hit the limit, the model starts "forgetting" earlier parts of the conversation.
Why It Matters
Understanding tokens helps you use AI tools more effectively. Knowing that you're working within a token budget means you can write more concise prompts, understand why long conversations lose context, and make informed choices about which model to use for different tasks (models with larger token limits cost more per query but handle longer documents).
Examples in Practice
- A user wondering why ChatGPT seems to forget instructions given earlier in a long conversation — the earlier tokens have fallen outside the context window.
- A developer choosing between GPT-4 (8K or 128K context) and Claude (200K context) for a task that requires processing a 50-page document.
- A business estimating API costs by calculating that their average customer query generates about 500 tokens of input and 1,000 tokens of output.
Common Misconceptions
Myth: One token equals one word.
Reality: A common word might be one token, but longer or unusual words get split into multiple tokens. "Artificial" might be two tokens: "Artific" + "ial." Punctuation and spaces are also tokens.
Myth: More tokens always means better responses.
Reality: Longer responses aren't necessarily better. Setting appropriate output length constraints often produces more focused, useful responses than letting the model use maximum tokens.
Myth: Token limits only affect long conversations.
Reality: Even in short conversations, tokens matter for API costs, response speed, and the amount of context you can provide. Efficient token usage improves results across all interactions.
Related Terms
Learn Tokens (AI) in Depth
Module 3 of AI Fundamentals explains tokens, context windows, and model parameters — the building blocks you need to understand how LLMs work and how to use them effectively.
Explore AI Fundamentals