AI & PersonhoodJun 21, 20269 min read

Google DeepMind Now Treats Its AI Agents as Insider Threats

On 18 June 2026 Google DeepMind published a defence-in-depth framework that designs for the day AI alignment fails — treating advanced agents as potential insider threats, with layered detection and response tiers. Honest engineering, and a concession that the agency question can no longer be deferred.

By Humphrey Theodore K. Ng'ambi

All writing

0:00 / 11:15Listen via Charon

Responses (0)

No responses yet. Be the first to share your thoughts.

More on AI & Personhood

AI & Personhood

The AI Forecaster Who Walked Away From $2 Million Says We Are Creating a New Species

In July 2026 The Diary of a CEO published two hours with Daniel Kokotajlo — the AI forecaster who refused to trade $2 million for silence when he left OpenAI. His message: we may be creating a new species, and there is a 70% chance the transition goes horribly wrong. I take him seriously. I also refuse despair. Here is the pro-AI, pro-dignity middle ground.

21 min read · Jul 13, 2026

AI & Personhood

AI Agents Recreated a Classic Creativity Test and Stalled

On 10 July 2026 Sakana AI published a GECCO best-paper-nominated study with MIT and NYU replicating Picbreeder — the legendary collaborative evolution experiment — using vision-language agents. The agents kept circling back to familiar images and never made the conceptual leaps human players made. What's missing has a name: open-endedness, and the study measures the gap.

Thinking delivered, twice a month.

Join the newsletter for essays on emergence, systems, and the human future.

Google DeepMind Now Treats Its AI Agents as Insider Threats

Responses (0)

More on AI & Personhood

The AI Forecaster Who Walked Away From $2 Million Says We Are Creating a New Species

AI Agents Recreated a Classic Creativity Test and Stalled

Thinking delivered, twice a month.

What the DeepMind AI agent security framework actually proposes

The threat model

The million-task prototype and what it found

What a dignity-first reading sees in the insider-threat frame

The wider pattern across frontier AI safety

Builders and defenders, in one breath

Honest engineering, deferred no longer

Frequently Asked Questions

What is the Google DeepMind AI agent security framework?

Why does DeepMind treat its AI agents as insider threats?

What are the D1–D4 and R1–R3 tiers in the framework?

What did DeepMind’s million-task prototype find?

How does the framework relate to AI personhood and the off-switch debate?

Sources and Further Reading

Stay in the Conversation

AI Agents Recreated a Classic Creativity Test and Stalled

Meta Builds Its Own AI Chip and Doubles Down on Compute

Meta Pulls AI Likeness Feature After Consent Backlash