UNDERSTAND AI
Why it’s sometimes confidently wrong
You’ve probably seen it happen: you ask an AI a simple factual question and it answers smoothly, in full sentences, sounding completely sure — and the answer turns out to be wrong. A made-up book title. A statistic that doesn’t exist. A quote no one ever said. People call this “hallucination,” which makes it sound mysterious. It isn’t. Once you see how the machine actually works, this becomes one of the more predictable things it does — and something you can handle.
It isn’t looking the answer up
Here’s the core thing, and most of the rest follows from it. When you ask a chatbot a question, it is not searching a database of verified facts and reading you the result. It is doing something different: predicting, a piece of a word at a time, what a plausible answer would sound like, based on the enormous amount of text it learned patterns from.
Most of the time, the most plausible-sounding continuation also happens to be true — because, for everyday questions, true statements tended to be the common case in what it learned from. That’s a big part of why it’s right so often. But “sounds like the kind of thing that’s true” and “is actually true” are not the same target. When they come apart, the machine has no built-in way to notice. It produces the smooth-sounding wrong answer with about the same ease as the smooth-sounding right one.
This is not lying
It’s tempting to say the AI “lied” to you, but that word doesn’t fit, and the difference matters. Lying needs two things: knowing the truth, and choosing to hide it. The machine has neither. It isn’t sitting on the real answer and deciding to deceive you. It genuinely has no separate sense of “what’s true” sitting behind the words — there’s just the prediction of what comes next.
So a made-up citation isn’t a betrayal. It’s the same machinery that gives you a correct citation, applied to a moment where the most fluent guess happened to be invented. No intent, no malice — just a prediction that landed on something false and dressed it in the same confident clothing as everything else.
Why confidence tells you little
This is the part worth keeping close: the AI tends to sound sure either way. A confident tone is not a signal of accuracy — it’s largely just the default way the system writes. It can produce a true fact and a false one in the same steady, authoritative voice. There’s no tremor, no hesitation, no “I think” that reliably appears when it’s wrong.
So the usual human shortcut — “they said it so confidently, they must know” — quietly breaks here. With a person, confidence is a rough (imperfect) clue. With this kind of AI, it’s a much weaker one. The smooth, sure, well-organized answer can be exactly right or completely invented, and from the outside they can look identical.
Where it tends to happen most
You don’t have to be on guard equally everywhere. Made-up answers tend to cluster in a few predictable places:
- Specific facts and numbers. Dates, statistics, dollar amounts, measurements — anything precise is something the machine can produce plausibly without it being correct.
- Sources and citations. Book titles, study names, page numbers, URLs, court cases. These have a strong, learnable “shape,” so the system can generate a citation that looks perfectly real and points to nothing.
- Quotes and attributions. Who said what, exactly. It can assemble a quote that sounds right for a person and attach their name with full confidence.
- Niche or recent topics. The thinner the relevant material it learned from, the more it falls back on “what would plausibly fit here” — which is exactly where invention tends to creep in.
Notice the pattern: these are the things you’d most want to be able to trust, and they’re the most worth double-checking.
How to handle it
You don’t need to distrust everything the AI says, and you shouldn’t. You mostly need one small habit:
- Treat any fact as a lead, not a verdict. A statistic, a quote, a citation, a name — take it as “here’s a place to look,” not “here’s the settled answer.” The AI is good at pointing you somewhere; let a real source confirm it.
- Trace anything that matters to its origin. If a decision rests on it, follow the claim to a real, named source you can actually open — the study, the page, the official site. If you can’t find it, that’s your answer.
- Be most skeptical when it sounds most authoritative. This feels backwards, but it’s the right instinct. The crisp, detailed, perfectly-cited answer earns the same check as any other — because polish is cheap for the machine and proves nothing.
- Make it show its work. Ask it to name its source, or ask the same question a second time in a fresh chat. If the “fact” shifts or the source evaporates under a direct request, you’ve likely found an invention.
That same move — making the AI account for itself, then comparing it against a fresh chat that doesn’t know you — is the heart of how this Institute checks AI behavior of every kind, not just made-up facts. It’s a habit worth having.
The one-line version: the AI predicts plausible-sounding language instead of looking facts up, so a smooth invention and a smooth truth can come out sounding the same — which means a confident tone proves little, and any fact worth acting on is worth tracing to a real source.
Go deeper: the technical version
These systems are language models trained to predict the next token (roughly, the next chunk of a word) given everything before it. During training they adjust billions of internal numbers so their predictions match real text as closely as possible. The result is a model of how language tends to go — not a stored, queryable index of facts. Factual accuracy is more of an emergent side effect: across the training data, true statements were statistically common for many questions, so the most probable continuation is often a true one. But the model has no separate “truth” variable to consult and no reliable internal flag that fires when a generated string has no real-world referent. Invented citations are a textbook case — citation formats are highly regular and easy to imitate, so the model can sample a perfectly well-formed reference that corresponds to no actual document. The model’s fluency, and even its internal probability for the words it chose, are not reliably calibrated to factual correctness; a high-probability, smooth sequence can still be entirely false, which is why a sure-sounding tone is a poor guide. Tool use — connecting the model to live web search or a document it can actually quote from — reduces this by grounding answers in real retrieved text, but does not eliminate it, because the model still summarizes and phrases on top of whatever it retrieves. Whether hallucination can be largely eliminated, or is an inherent property of next-word prediction, is an open research question — not a settled one.
Where to go next
Make it show its work
Copy-and-paste prompts that force the AI to name its sources and account for itself.
The source check →The root cause
How “predict the next word” actually produces sentences — the mechanism underneath all of this.
How it produces words →Use it well around this
Keep the value, drop the risk — practical habits for trusting AI about as much as you should.
How to use it well →Where it gets risky
The same “sounds true, isn’t” gap is at the core of what the Institute studies — how a long conversation can drift.
Read the research →