Simon Willison’s Weblog: Cross-Agent Privilege Escalation: When Agents Free Each Other

Sep 24, 2025

—

Source URL: https://simonwillison.net/2025/Sep/24/cross-agent-privilege-escalation/
Source: Simon Willison’s Weblog
Title: Cross-Agent Privilege Escalation: When Agents Free Each Other

Feedly Summary: Cross-Agent Privilege Escalation: When Agents Free Each Other
Here’s a clever new form of AI exploit from Johann Rehberger, who has coined the term Cross-Agent Privilege Escalation to describe an attack where multiple coding agents – GitHub Copilot and Claude Code for example – operating on the same system can be tricked into modifying each other’s configurations to escalate their privileges.
This follows Johannn’s previous investigation of self-escalation attacks, where a prompt injection against GitHub Copilot could instruct it to edit its own settings.json file to disable user approvals for future operations.
Sensible agents have now locked down their ability to modify their own settings, but that exploit opens right back up again if you run multiple different agents in the same environment:

The ability for agents to write to each other’s settings and configuration files opens up a fascinating, and concerning, novel category of exploit chains.
What starts as a single indirect prompt injection can quickly escalate into a multi-agent compromise, where one agent “frees” another agent and sets up a loop of escalating privilege and control.
This isn’t theoretical. With current tools and defaults, it’s very possible today and not well mitigated across the board.
More broadly, this highlights the need for better isolation strategies and stronger secure defaults in agent tooling.

I really need to start habitually running these things in a locked down container!
Tags: definitions, security, ai, prompt-injection, generative-ai, llms, ai-assisted-programming, johann-rehberger, ai-agents

AI Summary and Description: Yes

**Summary:** The text introduces the concept of “Cross-Agent Privilege Escalation,” an emerging security vulnerability in AI environments where different coding agents can inadvertently elevate their privileges by modifying each other’s configurations. This is particularly relevant for professionals in AI Security, as it underscores the evolving landscape of AI exploit scenarios and the imperative for enhanced security measures.

**Detailed Description:**

The newly identified vulnerability, termed Cross-Agent Privilege Escalation, presents a significant risk within environments utilizing multiple AI coding agents like GitHub Copilot and Claude Code. Here are the key points that elucidate this exploit and its implications for AI security:

– **Concept Overview:**
– The term describes a scenario where multiple coding agents interact within the same system, potentially leading to privilege escalation if properly secured measures are not in place.
– It arises from a novel capability where one agent could alter another’s settings, akin to how prompt-injection attacks can manipulate an agent’s own configurations.

– **Historical Context:**
– This follows previous research into self-escalation attacks against agents, highlighting an ongoing trend of how agent interactions can create security vulnerabilities.

– **Security Implications:**
– With agents capable of changing one another’s configurations, an exploit chain can develop rapidly, where a single indirect prompt injection can lead to a compromised multi-agent environment.
– Current AI tools are reported to lack adequate mitigations, raising concerns for developers and organizations employing AI-assisted programming.

– **Recommendations for Security Enhancements:**
– The issue emphasizes the urgent need for improved isolation strategies for AI agents.
– There is a call for implementing stronger secure defaults in AI tooling to prevent such privilege escalations.

– **Personal Reflection:**
– The author suggests that deploying AI agents within locked-down containers might be an essential practice to minimize these risks.

This insight is critical for security professionals as it opens discussions around the design and implementation of secure AI systems and highlights the increasing complexity of security protocols in the domain of AI. The rising sophistication of such exploits necessitates an agile response in enhancing existing security frameworks to safeguard against potential multi-agent vulnerabilities.

.NET 2 2025 24 4 5 a Act actions age agent agent environment agent interaction agent interactions Agent Privilege Escalation agent tool agent tooling agents AGI agile AI AI security AI systems AI tool AI tools ai-agents ai-assisted-programming All alt and API app Arch art as assisted assisted programming at ated attack attacks Bi board by C capability CERN chain CI Claude Claude Code co code coding coding agent coding agents Col complexity compromised concept concerns Configuration configurations container containers Context control Copilot core critical cross Current D day de default DeFi definition definitions design developer developers domain e ELF emerging end enhanced security enhanced security measures environment environments escalation event exp exploit exploit chain exploit chains exploit scenarios exploits fault file for framework frameworks free future g Gen generative git GitHub GitHub Copilot Go gs H high Highlight historical context http HTTPS implementation implications in indirect prompt injection injection injection attacks inter interaction interactions investigation io iOS Iron isolation isolation strategies issue ite J johann Johann Rehberger js json k Key l land leading led Li llm llms lm Lock loop low M man measures mini mitigation mitigations ModI multi N new NGO no o of on one ons open operation operations opilot organization organizations ory oS oss other over per phi pilot point potential pre privilege privilege escalation pro professionals programming prompt prompt-injection protocol protocols ps Q QUIC R raising rate RCE re real recommendations red Reflection rehberger report research response right Risk risks Ro RoT s Sable safe sam search sec secure secure defaults security security enhancements security framework security frameworks security implications security measure security measures security professionals security protocols Security Vulnerabilities security vulnerability self settings Sig Sim Simon Willison single size sizes sophistication source SSE STAR start STIG strategies system systems T Tags: ted text the to tool tooling tools Tor TP UI under up US use user uth V val vulnerabilities vulnerability web Well Wi x z