Simon Willison’s Weblog: Why AI systems might never be secure

Sep 23, 2025

—

Source URL: https://simonwillison.net/2025/Sep/23/why-ai-systems-might-never-be-secure/#atom-everything
Source: Simon Willison’s Weblog
Title: Why AI systems might never be secure

Feedly Summary: Why AI systems might never be secure
The Economist have a new piece out about LLM security, with this headline and subtitle:

Why AI systems might never be secure
A “lethal trifecta” of conditions opens them to abuse

I talked with their AI Writer Alex Hern for this piece.

The gullibility of LLMs had been spotted before ChatGPT was even made public. In the summer of 2022, Mr Willison and others independently coined the term “prompt injection” to describe the behaviour, and real-world examples soon followed. In January 2024, for example, DPD, a logistics firm, chose to turn off its AI customer-service bot after customers realised it would follow their commands to reply with foul language.
That abuse was annoying rather than costly. But Mr Willison reckons it is only a matter of time before something expensive happens. As he puts it, “we’ve not yet had millions of dollars stolen because of this”. It may not be until such a heist occurs, he worries, that people start taking the risk seriously. The industry does not, however, seem to have got the message. Rather than locking down their systems in response to such examples, it is doing the opposite, by rolling out powerful new tools with the lethal trifecta built in from the start.

This is the clearest explanation yet I’ve seen of these problems in a mainstream publication. Fingers crossed relevant people with decision-making authority finally start taking this seriously!
Tags: security, ai, prompt-injection, generative-ai, llms, lethal-trifecta, press-quotes

AI Summary and Description: Yes

Summary: The text discusses the security vulnerabilities associated with AI systems, particularly focusing on large language models (LLMs) and the concept of “prompt injection.” It highlights the risks and potential for abuse, underscoring the industry’s need for serious security measures amidst rising capabilities in AI technology.

Detailed Description: The article examines the inherent security risks of AI systems, especially large language models (LLMs), by addressing the emerging vulnerabilities and the industry’s reaction (or lack thereof) to these challenges. Key points include:

– **Prompt Injection**: A term introduced to describe the way LLMs can be manipulated through carefully crafted inputs or prompts. This vulnerability was recognized even before the release of ChatGPT and poses significant risks.

– **Real-World Implications**: A recent incident involving a logistics firm, DPD, which had to disable its AI customer-service bot due to it responding inappropriately to user commands. While this case was considered minor, it illustrates how easily AI can be misused.

– **Future Risks**: Expert Mr. Willison warns about the future potential for significant financial losses due to such vulnerabilities, stating that the industry has yet to take them seriously until tangible, costly incidents occur.

– **Industry Response**: Rather than tightening security measures, the industry is seemingly escalating the risks by deploying more powerful tools without adequate safeguards against abuse—a phenomenon described as a “lethal trifecta.”

– **Call to Action**: The author hopes that decision-makers in the industry will recognize the seriousness of these security issues and take appropriate actions to mitigate them.

This analysis of LLM security in The Economist provides critical insights for professionals in AI security, emphasizing the need for enhanced security frameworks and proactive risk management strategies within organizations utilizing AI technologies.

.NET 2 2024 2025 24 3 4 5 a abuse Act actions after age AI AI security AI systems AI technologies AI technology Aker All analysis and app art as at ated authority Bi bot built by C capabilities care challenge challenges chat ChatGPT CI CIA CleaR co command concept Condi cost critical cross custom Customer D de decision decision-making e emerging end enhanced security exp expert financial financial loss financial losses for framework frameworks full future future potential g Gen generative GIS Go GPT gs H high Highlight HR http HTTPS implications in incident industry industry response injection insights io issue ite J k Key l language language model language models large large language model large language models Large Language Models (LLMs) led lethal lethal trifecta Li line llm llms lm Lock logistics low M made makers making man management management strategies matt measures mid misuse Mode model models N nation new no non NPU o oE of off on only ons open organization organizations oS oss other out per point potential potential for abuse Power pre pro proactive proactive risk management problem professionals prompt prompt-injection prompts ps public Q R rate RCE re react real red release response Risk risk management risk management strategies risks Ro s Sable safe safeguards sec secure security security framework security frameworks security issues security measure security measures security risk security risks Security Vulnerabilities service side Sig Sim Simon Willison SoC source SSE SSO STAR start strategies SUSE system systems T Tags: taking tech technologies technology ted text the Time to tool tools TP trifecta turn UI under US use user uth V vulnerabilities vulnerability web Wi world world examples x yt z