The Recursion InstituteINDEPENDENT RESEARCH IN AI SAFETY

UNDERSTAND AI

How it actually produces words

When a chatbot answers you, it can feel like there’s someone in there — thinking, knowing, choosing what to say. There isn’t. Underneath the fluent paragraphs is one simple move, repeated very fast: the system predicts a likely next word, adds it, then predicts the next word, then the next — building the answer one small piece at a time. That’s the whole engine. It sounds almost too plain to explain something so impressive, and that’s exactly why it’s worth understanding. Once you see the move, you can read its confident sentences for what they are.

One word at a time

Picture the world’s most powerful autocomplete — the thing on your phone that guesses your next word, but vastly more capable. You give it some text, and it asks one question: given everything so far, what word is likely to come next? It picks one — usually one of the more likely options — sticks it on the end, and asks the same question again about the now-slightly-longer text. It loops that, word by word, until the answer is done.

So when you read a reply that seems planned out — an intro, three tidy points, a neat conclusion — it wasn’t written from an outline the way you might write one. The structure shows up because that’s the shape text usually takes after a question like yours. The system is following the strong current of “what normally comes next here,” not a document it drafted before it started typing. (Real systems work with pieces a little smaller than whole words — we’ll get to that — but “next word” is the honest picture.)

Where the patterns come from

For this to work, the system has to be good at guessing what comes next — and that ability isn’t programmed in by hand. It’s learned, in a stage called training. Before you ever typed to it, the system was shown an enormous amount of human-written text — books, articles, websites, conversations — and asked, over and over, to predict the next word in passages where the real answer was already known. Each time it guessed, it got nudged: a little closer when it was right, corrected when it was wrong. Do that an enormous number of times and it gets startlingly good at the patterns of how language fits together.

That’s the part worth sitting with. What it absorbed was the shape of language — which words, ideas, and phrasings tend to follow which — not a checked, organized library of facts. It can produce a sentence about the boiling point of water not because it looked the number up, but because, across all that text, that sentence is the kind of thing that reliably follows your question. Usually that lands on the truth, because true statements are common in what it read. But the mechanism is “what fits here,” not “what’s verified.” Hold onto that distinction; it explains a lot.

What it is not doing

It helps to name the things that aren’t happening, because our instinct fills them in:

Why this makes it so fluent — and so easy to misread

Predicting the next word is not a trivial trick. To do it well across nearly any topic, the system has to capture grammar, tone, structure, context, and the way ideas usually connect. That genuine capability is why these tools are useful: they draft, summarize, explain, translate, and brainstorm with real skill. Calling it “just predicting the next word” isn’t a put-down — it’s a description of something that turns out to be remarkably powerful. The point isn’t to shrink it. It’s to see it clearly.

And here’s the catch the same mechanism creates. The thing it’s built to produce is text that sounds right — fluent, confident, well-formed. Whether the content is accurate is a separate question the engine never directly asked. So fluency is not evidence of knowledge, and confidence is not evidence of correctness. The system can produce a perfectly poised, authoritative-sounding paragraph that happens to be wrong, and it will sound exactly as sure of that as of anything true. Knowing how the words are made is what lets you keep that in mind — to enjoy the fluency without mistaking it for a guarantee.

The one-line version: a chatbot writes by predicting a likely next word over and over, using patterns it learned from huge amounts of text — it’s not looking up facts or checking what’s true, which is exactly why it can sound completely confident and still be wrong.

Go deeper: the technical version

The systems behind chatbots are large language models, most of them built on an architecture called the transformer. They don’t operate on whole words but on tokens — common chunks of text, where a token might be a short word, part of a longer word, or a piece of punctuation. At each step the model takes the running sequence of tokens and outputs a probability distribution over its entire vocabulary of possible next tokens — effectively a score for how likely each candidate is to come next. It then samples one from that distribution (a setting often called temperature controls how strongly it favors the top choices versus reaching for less likely ones), appends it, and feeds the longer sequence back in to predict the token after that. The model itself is a network of billions of numerical values called weights, set during training: the system was repeatedly shown text with the next token hidden, predicted it, and had its weights adjusted to reduce the error — a process driven by gradient descent. After this base training, most chatbots get an additional tuning stage that uses human feedback to make their replies more helpful and better-behaved. But none of it adds a fact-checking step or a truth database. “Knowledge” is whatever got compressed into those weights as statistical regularities — which is why the same machinery that makes the model fluent also lets it state false things with complete fluency. One caveat: exactly what happens inside those weights at each step is still being mapped. Researchers studying model internals have found signs that a model can sometimes line up something it’s about to say a few words ahead — so “pure word-by-word, no look-ahead at all” is the right intuition for everyday use, but the fuller picture is an active area of study, not a settled one.

Where to go next

Why it’s sometimes confidently wrong

The next-word engine explains the most useful thing to know: how a system this fluent can still just make things up.

Why AI makes things up →

Why the ‘I’ is just grammar

It says “I think” and “I feel” for the same reason it says anything — those words fit. Here’s why that isn’t a person.

Why it talks like a person →

Back to: what an LLM is

The bigger picture this fits into — what a large language model actually is, in plain terms.

What is an LLM? →

Where this gets risky

When a fluent system slowly shapes what someone believes — the failure mode RI studies, and why understanding the mechanism is the safeguard.

The research →