Tag: generated
-
Unit 42: Bad Likert Judge: A Novel Multi-Turn Technique to Jailbreak LLMs by Misusing Their Evaluation Capability
Source URL: https://unit42.paloaltonetworks.com/?p=138017 Source: Unit 42 Title: Bad Likert Judge: A Novel Multi-Turn Technique to Jailbreak LLMs by Misusing Their Evaluation Capability Feedly Summary: The jailbreak technique “Bad Likert Judge" manipulates LLMs to generate harmful content using Likert scales, exposing safety gaps in LLM guardrails. The post Bad Likert Judge: A Novel Multi-Turn Technique to…
-
Hacker News: Letting Language Models Write My Website
Source URL: https://nicholas.carlini.com/writing/2025/llms-write-my-bio.html Source: Hacker News Title: Letting Language Models Write My Website Feedly Summary: Comments AI Summary and Description: Yes Summary: The text presents an engaging exploration of the capabilities and limitations of large language models (LLMs) through a creative project where the author generates a new homepage and biography each day using different…