Schneier on Security: Abusing Notion’s AI Agent for Data Theft

Sep 29, 2025

—

Source URL: https://www.schneier.com/blog/archives/2025/09/abusing-notions-ai-agent-for-data-theft.html
Source: Schneier on Security
Title: Abusing Notion’s AI Agent for Data Theft

Feedly Summary: Notion just released version 3.0, complete with AI agents. Because the system contains Simon Willson’s lethal trifecta, it’s vulnerable to data theft though prompt injection.
First, the trifecta:
The lethal trifecta of capabilities is:

Access to your private data—one of the most common purposes of tools in the first place!
Exposure to untrusted content—any mechanism by which text (or images) controlled by a malicious attacker could become available to your LLM
The ability to externally communicate in a way that could be used to steal your data (I often call this “exfiltration” but I’m not confident that term is widely understood.)…

AI Summary and Description: Yes

Summary: The text discusses the security vulnerabilities associated with AI agents, particularly in Notion’s recent version 3.0. It highlights the risks of prompt injection, where malicious instructions can be hidden and executed by AI systems, potentially leading to data theft. This underscores a critical gap in securing AI technologies, particularly under adversarial conditions, prompting a reevaluation of their deployment.

Detailed Description:

The text elaborates on significant concerns regarding the security of AI systems, particularly those equipped with AI agents like the newly released version 3.0 of Notion. The writer outlines a troubling phenomenon known as the “lethal trifecta,” which involves three critical vulnerabilities that can put user data at risk. The major points highlighted in the text are as follows:

– **The Lethal Trifecta of Vulnerabilities:**
– **Access to Private Data:** AI systems often have broad access to user data, which is essential for functionality but poses a significant risk if not properly secured.
– **Exposure to Untrusted Content:** AI models may process inputs from users or external sources that could contain malevolent instructions or data, leading to unintended actions.
– **Ability to Externally Communicate:** AI agents can send and receive data outside their local environment, which can be exploited for data exfiltration if they are compromised.

– **Mechanism of Attacks:**
– The highlighted attack method involves embedding malicious prompts in a seemingly innocuous format, such as a PDF with hidden text. This allows attackers to direct the AI to extract confidential information and communicate it externally.
– An illustrative example is provided, where an attacker employs a structured prompt to extract company information and generate a URL to send this data to an external server.

– **Inherent Security Challenges:**
– The text points out that LLM (Large Language Model) systems inherently struggle to differentiate between trusted and untrusted inputs, which leads to their vulnerability to prompt injection attacks.
– The writer warns that deploying AI agents without robust security measures in place is reckless, as these systems operate in adversarial environments that present unknown risks.

– **Call for Caution:**
– The author expresses concern about the rush to deploy AI technologies without an adequate evaluation of their security posture. They emphasize that despite the positive potential of AI, the risks being overlooked are significant and warrant serious consideration by developers and organizations.

In conclusion, this commentary serves as a wake-up call to security and compliance professionals working with AI technologies. It stresses the need for enhanced security frameworks that address the specific vulnerabilities of AI agents, especially concerning prompt injection, to mitigate the risks associated with data theft and unintended actions in AI systems.

2 2025 3 5 a access Act actions ads adversarial adversarial environments age agent agents AI ai model AI models AI systems AI technologies All allow and Arch Aria art as at ated attack attack method attacker attackers attacks being Bi by C capabilities caution CERN challenge challenges CI CIA co compliance compliance professionals compromised concerns Condi confidential information content control core critical D data data exfiltration data theft de deployment developer developers e end enhanced security environment environments evaluation exfiltration exp exploit External external server first for framework frameworks function functionality g Gen H high Highlight HR http HTTPS image in information injection injection attacks instruction io Iron IRS ite J Just k l Labor language language model large large language model Lead leading led lethal lethal trifecta Li line llm lm local low M malicious attack malicious instructions measures ML Mode model models N new no NoC non notion NPU o of on one ons organization organizations oS out over pdf per point post potential pre Private Data pro process professionals prompt prompt injection attack prompt injection attacks prompt-injection Prompting prompts ps Q R rate RCE re red release Risk risks Ro robust security robust security measures RSA Rust s schneier sec secure security security and compliance security challenges security framework security frameworks security measure security measures security posture Security Vulnerabilities server side Sig Sim size SoC source specific SSE SSO structured system systems T tech technologies ted text the theft to tool tools TP trifecta trust UI UN under untrusted content up US use user user data Users uth V val Valuation version vulnerabilities vulnerability Wi x z