Tag: Generated Content
-
Simon Willison’s Weblog: Quoting Colin Fraser
Source URL: https://simonwillison.net/2025/Jan/4/colin-fraser/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Colin Fraser Feedly Summary: Claude is not a real guy. Claude is a character in the stories that an LLM has been programmed to write. Just to give it a distinct name, let’s call the LLM “the Shoggoth". When you have a conversation with Claude, what’s…
-
Unit 42: Bad Likert Judge: A Novel Multi-Turn Technique to Jailbreak LLMs by Misusing Their Evaluation Capability
Source URL: https://unit42.paloaltonetworks.com/?p=138017 Source: Unit 42 Title: Bad Likert Judge: A Novel Multi-Turn Technique to Jailbreak LLMs by Misusing Their Evaluation Capability Feedly Summary: The jailbreak technique “Bad Likert Judge" manipulates LLMs to generate harmful content using Likert scales, exposing safety gaps in LLM guardrails. The post Bad Likert Judge: A Novel Multi-Turn Technique to…