Tag: AI security
-
Embrace The Red: Windsurf: Memory-Persistent Data Exfiltration (SpAIware Exploit)
Source URL: https://embracethered.com/blog/posts/2025/windsurf-spaiware-exploit-persistent-prompt-injection/ Source: Embrace The Red Title: Windsurf: Memory-Persistent Data Exfiltration (SpAIware Exploit) Feedly Summary: In this second post about Windsurf Cascade we are exploring the SpAIware attack, which allows memory persistent data exfiltration. SpAIware is an attack we first successfully demonstrated with ChatGPT last year and OpenAI mitigated. While inspecting the system prompt…
-
The Register: Anthropic scanning Claude chats for queries about DIY nukes for some reason
Source URL: https://www.theregister.com/2025/08/21/anthropic_claude_nuclear_chat_detection/ Source: The Register Title: Anthropic scanning Claude chats for queries about DIY nukes for some reason Feedly Summary: Because savvy terrorists always use public internet services to plan their mischief, right? Anthropic says it has scanned an undisclosed portion of conversations with its Claude AI model to catch concerning inquiries about nuclear…
-
Unit 42: Logit-Gap Steering: A New Frontier in Understanding and Probing LLM Safety
Source URL: https://unit42.paloaltonetworks.com/logit-gap-steering-impact/ Source: Unit 42 Title: Logit-Gap Steering: A New Frontier in Understanding and Probing LLM Safety Feedly Summary: New research from Unit 42 on logit-gap steering reveals how internal alignment measures can be bypassed, making external AI security vital. The post Logit-Gap Steering: A New Frontier in Understanding and Probing LLM Safety appeared…
-
Wired: Do Large Language Models Dream of AI Agents?
Source URL: https://www.wired.com/story/sleeptime-compute-chatbots-memory/ Source: Wired Title: Do Large Language Models Dream of AI Agents? Feedly Summary: For AI models, knowing what to remember might be as important as knowing what to forget. Welcome to the era of “sleeptime compute.” AI Summary and Description: Yes Summary: The text introduces the concept of “sleeptime compute,” which emphasizes…