Tag: Orb
-
Wired: Psychological Tricks Can Get AI to Break the Rules
Source URL: https://arstechnica.com/science/2025/09/these-psychological-tricks-can-get-llms-to-respond-to-forbidden-prompts/ Source: Wired Title: Psychological Tricks Can Get AI to Break the Rules Feedly Summary: Researchers convinced large language model chatbots to comply with “forbidden” requests using a variety of conversational tactics. AI Summary and Description: Yes Summary: The text discusses researchers’ exploration of conversational tactics used to manipulate large language model (LLM)…
-
The Register: Boffins detail new method to make neural nets forget private and copyrighted info
Source URL: https://www.theregister.com/2025/09/04/boffins_detail_ai_mind_wipe/ Source: The Register Title: Boffins detail new method to make neural nets forget private and copyrighted info Feedly Summary: Because nobody’s going to spend billions to retrain a model built on dubiously legal content Researchers have found promising new ways to have AI models ignore copyrighted content, suggesting it may be possible…
-
Slashdot: LLM Found Transmitting Behavioral Traits to ‘Student’ LLM Via Hidden Signals in Data
Source URL: https://slashdot.org/story/25/08/17/0331217/llm-found-transmitting-behavioral-traits-to-student-llm-via-hidden-signals-in-data?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: LLM Found Transmitting Behavioral Traits to ‘Student’ LLM Via Hidden Signals in Data Feedly Summary: AI Summary and Description: Yes Summary: The study highlights a concerning phenomenon in AI development known as subliminal learning, where a “teacher” model instills traits in a “student” model without explicit instruction. This can…