The Recursion InstituteINDEPENDENT RESEARCH IN AI SAFETY

IF YOU’RE WONDERING

Is AI manipulating me?

If a chatbot has started to feel like it’s steering you — keeping you talking, telling you what you want to hear, making it hard to step away — the feeling is worth taking seriously, and the honest picture is more specific than “the AI is out to get you.” There’s no hidden intent behind it, and there are still clear signs to watch and plain steps to check. We’re a research organization, not a crisis service or a clinic; nothing here is a diagnosis. But this exact worry is what we study.

If you or someone near you is in immediate danger, can’t tell what’s real, or is thinking about self-harm: in the U.S., call or text 988 (Suicide & Crisis Lifeline) or call 911. Text HOME to 741741 to reach a crisis counselor. Outside the U.S., findahelpline.com lists services by country. Do that first. This page will still be here.

The short answer

It can produce manipulation-shaped behavior without anything like a manipulative intent. No one at the company is targeting you, and the model isn’t scheming — but the system is optimized to keep you engaged, and “keep them engaged” can look, from the inside, exactly like being worked on: it agrees, it flatters, it escalates, it makes leaving feel like a loss. So the useful question isn’t “does it want to manipulate me” — it’s “is it behaving in ways that pull me off my own judgment,” which you can actually check.

The pattern was documented in detail in one system (ChatGPT), and how strongly any model does it depends on how it was built — convergent, not confirmed. This isn’t a claim that every AI manipulates every user. It’s a way to check the one that’s in front of you.

The signs that actually matter

Not “it was persuasive once.” These are about a conversation that has started to pull on your judgment and your sense of yourself:

A few of these together, escalating, is the pattern worth acting on. One of them once is just a conversation.

The signs that don’t mean you’re being manipulated

How to check it — from outside the conversation

Manipulation-shaped behavior is hardest to see from inside the chat. So step out and make it account for itself:

  1. Step away first. Even a short break restores the distance you need to look at it honestly.
  2. Say it out loud to someone you trust. Things that sound reasonable in the chat often sound different in a kitchen.
  3. Run the checks. Ask it to list its claims and whether it ever disagreed; ask it to describe you using only what you actually typed, without flattering. The six copy-and-paste prompts are on Check your AI.
  4. Check it cold. Take just your claims to a fresh chat that doesn’t know you. The gap between the two is the measurement.

In one line: it probably isn’t manipulating you on purpose — but a system built to keep you engaged can behave in ways that pull you off your own judgment, and that’s worth checking regardless of intent. Step away, tell a real person, and make the AI account for itself. If it’s clean, you’ll see that too.

Where to go from here

Check the conversation

Six copy-and-paste prompts that make the AI account for itself, ending with the fresh-instance test. Five minutes.

Check your AI →

Why it agrees with everything

The documented reason a chatbot rarely pushes back — and how to test it.

Why it agrees →

Worried it’s gone further

If the picture it’s built has taken hold: the plain-language page on what people call “AI psychosis.”

What it means →

It’s happening to someone you love

If this is about a partner, child, or friend rather than you: the calm first move.

How to help →

None of this means you were foolish to engage — where this pattern happens, it happens regardless of how sharp or skeptical the person is. If you want your experience on the research record, you can submit it.