Jailbreak Techniques – Experimental News Clipping Site

CSA: Exploiting Trusted AI: GPTs in Cyberattacks

Jun 7, 2025

—

by

Source URL: https://abnormal.ai/blog/how-attackers-exploit-trusted-ai-tools Source: CSA Title: Exploiting Trusted AI: GPTs in Cyberattacks Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the emergence of malicious AI, particularly focusing on how generative pre-trained transformers (GPTs) are being exploited by cybercriminals. It highlights the potential risks posed by these technologies, including sophisticated fraud tactics and…

Unit 42: Investigating LLM Jailbreaking of Popular Generative AI Web Products

Feb 21, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://unit42.paloaltonetworks.com/jailbreaking-generative-ai-web-products/ Source: Unit 42 Title: Investigating LLM Jailbreaking of Popular Generative AI Web Products Feedly Summary: We discuss vulnerabilities in popular GenAI web products to LLM jailbreaks. Single-turn strategies remain effective, but multi-turn approaches show greater success. The post Investigating LLM Jailbreaking of Popular Generative AI Web Products appeared first on Unit 42.…

Cloud Blog: Enhance Gemini model security with content filters and system instructions

Feb 13, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/enhance-gemini-model-security-with-content-filters-and-system-instructions/ Source: Cloud Blog Title: Enhance Gemini model security with content filters and system instructions Feedly Summary: As organizations rush to adopt generative AI-driven chatbots and agents, it’s important to reduce the risk of exposure to threat actors who force AI models to create harmful content. We want to highlight two powerful capabilities…

Tag: Jailbreak Techniques

CSA: Exploiting Trusted AI: GPTs in Cyberattacks

Unit 42: Investigating LLM Jailbreaking of Popular Generative AI Web Products

Cloud Blog: Enhance Gemini model security with content filters and system instructions