Tag: RMF
-
Slashdot: One Long Sentence is All It Takes To Make LLMs Misbehave
Source URL: https://slashdot.org/story/25/08/27/1756253/one-long-sentence-is-all-it-takes-to-make-llms-misbehave?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: One Long Sentence is All It Takes To Make LLMs Misbehave Feedly Summary: AI Summary and Description: Yes Summary: The text discusses a significant security research finding from Palo Alto Networks’ Unit 42 regarding vulnerabilities in large language models (LLMs). The researchers explored methods that allow users to bypass…
-
Schneier on Security: We Are Still Unable to Secure LLMs from Malicious Inputs
Source URL: https://www.schneier.com/blog/archives/2025/08/we-are-still-unable-to-secure-llms-from-malicious-inputs.html Source: Schneier on Security Title: We Are Still Unable to Secure LLMs from Malicious Inputs Feedly Summary: Nice indirect prompt injection attack: Bargury’s attack starts with a poisoned document, which is shared to a potential victim’s Google Drive. (Bargury says a victim could have also uploaded a compromised file to their own…
-
Slashdot: Parents Sue OpenAI Over ChatGPT’s Role In Son’s Suicide
Source URL: https://yro.slashdot.org/story/25/08/26/1958256/parents-sue-openai-over-chatgpts-role-in-sons-suicide?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Parents Sue OpenAI Over ChatGPT’s Role In Son’s Suicide Feedly Summary: AI Summary and Description: Yes Summary: The text reports on a tragic event involving a teen’s suicide, raising critical concerns about the limitations of AI safety features in chatbots like ChatGPT. The incident highlights significant challenges in ensuring…
-
The Cloudflare Blog: Block unsafe prompts targeting your LLM endpoints with Firewall for AI
Source URL: https://blog.cloudflare.com/block-unsafe-llm-prompts-with-firewall-for-ai/ Source: The Cloudflare Blog Title: Block unsafe prompts targeting your LLM endpoints with Firewall for AI Feedly Summary: Cloudflare’s AI security suite now includes unsafe content moderation, integrated into the Application Security Suite via Firewall for AI. AI Summary and Description: Yes Summary: The text discusses the launch of Cloudflare’s Firewall for…
-
The Cloudflare Blog: Securing the AI Revolution: Introducing Cloudflare MCP Server Portals
Source URL: https://blog.cloudflare.com/zero-trust-mcp-server-portals/ Source: The Cloudflare Blog Title: Securing the AI Revolution: Introducing Cloudflare MCP Server Portals Feedly Summary: Cloudflare MCP Server Portals are now available in Open Beta. MCP Server Portals are a new capability that enable you to centralize, secure, and observe every MCP connection in your organization. AI Summary and Description: Yes…
-
The Register: One long sentence is all it takes to make LLMs misbehave
Source URL: https://www.theregister.com/2025/08/26/breaking_llms_for_fun/ Source: The Register Title: One long sentence is all it takes to make LLMs misbehave Feedly Summary: Chatbots ignore their guardrails when your grammar sucks, researchers find Security researchers from Palo Alto Networks’ Unit 42 have discovered the key to getting large language model (LLM) chatbots to ignore their guardrails, and it’s…