Tag: multi-turn attacks
-
Unit 42: Bad Likert Judge: A Novel Multi-Turn Technique to Jailbreak LLMs by Misusing Their Evaluation Capability
Source URL: https://unit42.paloaltonetworks.com/?p=138017 Source: Unit 42 Title: Bad Likert Judge: A Novel Multi-Turn Technique to Jailbreak LLMs by Misusing Their Evaluation Capability Feedly Summary: The jailbreak technique “Bad Likert Judge" manipulates LLMs to generate harmful content using Likert scales, exposing safety gaps in LLM guardrails. The post Bad Likert Judge: A Novel Multi-Turn Technique to…
-
CSA: How Multi-Turn Attacks Generate Harmful AI Content
Source URL: https://cloudsecurityalliance.org/blog/2024/09/30/how-multi-turn-attacks-generate-harmful-content-from-your-ai-solution Source: CSA Title: How Multi-Turn Attacks Generate Harmful AI Content Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the vulnerabilities of Generative AI chatbots to Multi-Turn Attacks, highlighting how they can be manipulated over multiple interactions to elicit harmful content. It emphasizes the need for improved AI security measures…