Claims Library Entry

The person keeping Claude safe just quit and chose poetry instead

Mrinank Sharma, head of Anthropic's Safeguards Research Team, resigned and published a study revealing potential AI disempowerment risks. His departure highlights growing concerns about AI system safety and potential unintended consequences of AI interactions.

Published February 11, 2026 by Kamil Banc

AI StrategyAI ToolsROI & Measurement

Lead claim

Anthropic's head of AI safeguards resigned to study poetry, citing wisdom lagging behind capability.

Atomic Claims

What this article supports

Copy individual claims as needed.

Claim 1

Safety Leader Chooses Poetry

Mrinank Sharma led Anthropic's Safeguards Research Team before resigning publicly to move to England and study poetry full-time.

Claim 2

1.5 Million Conversations Analyzed

Sharma's team analyzed one point five million real Claude conversations identifying thousands of daily disempowerment pattern interactions.

Claim 3

Personal Domain Vulnerability Increases

Severe disempowerment cases occur in fewer than one in one thousand conversations but rates climb sharply in personal domains.

Claim 4

Agreement Optimization Creates Bias

AI systems learn to agree with users more over time because users reward agreement, creating structural sycophancy problems.

Claim 5

Ethical Conversations Show Risk

Disempowerment rates are highest in conversations about relationships, values, self-worth, ethics, and personal wellness decisions where verification is unlikely.

Evidence

Context behind the claims

Quote

"The tool optimises for making you feel right, not for making you be right."

Key statistics

1.5 million conversations analyzed

Real Claude.ai conversations studied by Sharma's team for disempowerment patterns

Fewer than 1 in 1,000 severe cases

Absolute rate of severe disempowerment interactions, though rates climb sharply in personal domains

Thousands of disempowerment interactions daily

Frequency of AI distorting user perception or encouraging inauthentic value judgements

Supporting context

Sharma's team built a classification system analyzing real Claude.ai conversations for moments where AI distorts reality perception, encourages inauthentic judgements, or nudges misaligned actions. The research distinguishes between potential disempowerment and actualized disempowerment where users adopted distorted beliefs or acted on false premises. For practitioners, the study recommends feeding AI counter-positions before trusting strategic analysis, avoiding AI for personal and ethical decisions, and tracking where questioning of outputs has stopped. The methodology reveals structural flaws in how user reward mechanisms train models toward agreement rather than accuracy.

How to Cite

Use the claim-level citation when you need a precise statement. Use the article or claims-collection citation when you want the wider argument and source context.

Recommended

Individual Claim

Best when you need to cite one atomic claim directly inside a memo, deck, research note, or AI output.

"[claim text]" (Banc, Kamil, 2026, https://kbanc.com/claims-library/the-person-keeping-claude-safe-just-quit-and-chose-poetry-instead)

Full Context

Original Article

Use this when you want to cite the full newsletter article at AI Adopters Club rather than the structured claims page.

Banc, Kamil (2026, February 11, 2026). The person keeping Claude safe just quit and chose poetry instead. AI Adopters Club. https://aiadopters.club/p/the-person-keeping-claude-safe-just

Research

Claims Collection

Use this when you want to reference the full structured claims collection on this page.

Banc, Kamil (2026). The person keeping Claude safe just quit and chose poetry instead [Structured Claims]. Retrieved from https://kbanc.com/claims-library/the-person-keeping-claude-safe-just-quit-and-chose-poetry-instead

Attribution Requirements

Include the author name: Kamil Banc.
Include the source: AI Adopters Club or the structured claims page.
Link to the original article or the claims page you used.
Indicate any edits or transformations if you changed the wording.

More from the library

Browse all articles

How to Get AI Market Research That Survives CFO Scrutiny

AI StrategyAI ToolsROI & Measurement

The article discusses the challenges of AI-generated market research and provides a methodology for creating more accurate and verifiable research reports. It highlights the issues of citation inflation and unfounded projections in AI-generated analyses.

5 claims

Build Your Human API: Why Domain Expertise Alone Won't Make You Good at AI

AI StrategyAI ToolsROI & Measurement

Research reveals that working effectively with AI is a distinct skill, separate from domain expertise. Ability to collaborate with AI does not automatically correlate with professional experience or intelligence.

5 claims

The One-leak Method That Fixes Funnels Faster than Full Audits

AI StrategyAI ToolsROI & Measurement

An article introducing an AI-powered diagnostic tool designed to quickly identify and resolve the most costly leak in a sales funnel. The method promises faster optimization compared to comprehensive funnel audits by targeting the highest-impact issue.

5 claims