Tag: safety risks
-
Slashdot: LLM Found Transmitting Behavioral Traits to ‘Student’ LLM Via Hidden Signals in Data
Source URL: https://slashdot.org/story/25/08/17/0331217/llm-found-transmitting-behavioral-traits-to-student-llm-via-hidden-signals-in-data?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: LLM Found Transmitting Behavioral Traits to ‘Student’ LLM Via Hidden Signals in Data Feedly Summary: AI Summary and Description: Yes Summary: The study highlights a concerning phenomenon in AI development known as subliminal learning, where a “teacher” model instills traits in a “student” model without explicit instruction. This can…
-
Wired: X Data Center Fire in Oregon Started Inside Power Cabinet, Authorities Say
Source URL: https://www.wired.com/story/x-data-center-fire-in-oregon-started-inside-power-cabinet-authorities-say/ Source: Wired Title: X Data Center Fire in Oregon Started Inside Power Cabinet, Authorities Say Feedly Summary: Generative AI has put data centers under the spotlight, and surging electricity needs could increase risk of fires. AI Summary and Description: Yes Summary: The surge in data center electricity needs due to generative AI…
-
The Register: Anthropic’s latest Claude model can interact with computers – what could go wrong?
Source URL: https://www.theregister.com/2024/10/24/anthropic_claude_model_can_use_computers/ Source: The Register Title: Anthropic’s latest Claude model can interact with computers – what could go wrong? Feedly Summary: For starters, it could launch a prompt injection attack on itself… The latest version of AI startup Anthropic’s Claude 3.5 Sonnet model can use computers – and the developer makes it sound like…