AI & PersonhoodJun 17, 202610 min read

OpenAI's Deployment Simulation Is AI Safety Without the People

OpenAI published a pre-release safety method replaying 1.3 million real conversations to predict bad model behaviour. The engineering is impressive — but simulating a deployment is not the same as being accountable to the people deployed upon.

By Humphrey Theodore K. Ng'ambi

All writing

0:00 / 14:59Listen via Charon

Responses (0)

No responses yet. Be the first to share your thoughts.

More on AI & Personhood

AI & Personhood

The AI Forecaster Who Walked Away From $2 Million Says We Are Creating a New Species

In July 2026 The Diary of a CEO published two hours with Daniel Kokotajlo — the AI forecaster who refused to trade $2 million for silence when he left OpenAI. His message: we may be creating a new species, and there is a 70% chance the transition goes horribly wrong. I take him seriously. I also refuse despair. Here is the pro-AI, pro-dignity middle ground.

21 min read · Jul 13, 2026

AI & Personhood

AI Agents Recreated a Classic Creativity Test and Stalled

On 10 July 2026 Sakana AI published a GECCO best-paper-nominated study with MIT and NYU replicating Picbreeder — the legendary collaborative evolution experiment — using vision-language agents. The agents kept circling back to familiar images and never made the conceptual leaps human players made. What's missing has a name: open-endedness, and the study measures the gap.

Thinking delivered, twice a month.

Join the newsletter for essays on emergence, systems, and the human future.

OpenAI's Deployment Simulation Is AI Safety Without the People

Responses (0)

More on AI & Personhood

The AI Forecaster Who Walked Away From $2 Million Says We Are Creating a New Species

AI Agents Recreated a Classic Creativity Test and Stalled

Thinking delivered, twice a month.

How Deployment Simulation works

The calculator-hacking case

What Deployment Simulation reveals about OpenAI's safety posture

Prediction is not accountability

The taxonomy gap

Simulation versus relation

What relational safety would require

The agentic frontier and what comes next

Frequently Asked Questions

What is OpenAI Deployment Simulation?

How many conversations did OpenAI analyse for Deployment Simulation?

What is calculator hacking in AI models?

Does Deployment Simulation make AI safe before release?

How does OpenAI Deployment Simulation compare to Anthropic's AI safety approach?

Sources and Further Reading

Stay in the Conversation

AI Agents Recreated a Classic Creativity Test and Stalled

Meta Builds Its Own AI Chip and Doubles Down on Compute

Meta Pulls AI Likeness Feature After Consent Backlash