AI safety – Page 7 – Experimental News Clipping Site

The Register: Amazon boots local Alexa processing: All your voice requests shipped to the cloud

Mar 17, 2025

—

by

Source URL: https://www.theregister.com/2025/03/17/amazon_kills_on_device_alexa/ Source: The Register Title: Amazon boots local Alexa processing: All your voice requests shipped to the cloud Feedly Summary: Web souk says Echo hardware doesn’t have the oomph for next-gen AI anyway Come March 28, those who opted to have their voice commands for Amazon’s AI assistant Alexa processed locally on their…

Hacker News: Strengthening AI Agent Hijacking Evaluations

Mar 16, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.nist.gov/news-events/news/2025/01/technical-blog-strengthening-ai-agent-hijacking-evaluations Source: Hacker News Title: Strengthening AI Agent Hijacking Evaluations Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines security risks related to AI agents, particularly focusing on “agent hijacking,” where malicious instructions can be injected into data handled by AI systems, leading to harmful actions. The U.S. AI Safety…

Wired: Under Trump, AI Scientists Are Told to Remove ‘Ideological Bias’ From Powerful Models

Mar 14, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.wired.com/story/ai-safety-institute-new-directive-america-first/ Source: Wired Title: Under Trump, AI Scientists Are Told to Remove ‘Ideological Bias’ From Powerful Models Feedly Summary: A directive from the National Institute of Standards and Technology eliminates mention of “AI safety” and “AI fairness.” AI Summary and Description: Yes Summary: The National Institute of Standards and Technology (NIST) has revised…

Hacker News: OpenAI Asks White House for Relief from State AI Rules

Mar 13, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://finance.yahoo.com/news/openai-asks-white-house-relief-100000706.html Source: Hacker News Title: OpenAI Asks White House for Relief from State AI Rules Feedly Summary: Comments AI Summary and Description: Yes Summary: The text outlines OpenAI’s request for U.S. federal support to protect AI companies from state regulations while promoting collaboration with the government. By sharing their models voluntarily, AI firms…

METR updates – METR: Why it’s good for AI reasoning to be legible and faithful

Mar 13, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://metr.org/blog/2025-03-11-good-for-ai-to-reason-legibly-and-faithfully/ Source: METR updates – METR Title: Why it’s good for AI reasoning to be legible and faithful Feedly Summary: AI Summary and Description: Yes **Summary:** The text explores the significance of legible and faithful reasoning in AI systems, emphasizing its role in enhancing AI safety and transparency, and addresses the challenges and…

Google Online Security Blog: Vulnerability Reward Program: 2024 in Review

Mar 7, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: http://security.googleblog.com/2025/03/vulnerability-reward-program-2024-in.html Source: Google Online Security Blog Title: Vulnerability Reward Program: 2024 in Review Feedly Summary: AI Summary and Description: Yes Summary: The text discusses Google’s Vulnerability Reward Program (VRP) for 2024, highlighting its financial support for security researchers and improvements to the program. Notable enhancements include revamped reward structures for mobile, Chrome, and…

OpenAI : Nubank elevates customer experiences with OpenAI

Mar 7, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://openai.com/index/nubank Source: OpenAI Title: Nubank elevates customer experiences with OpenAI Feedly Summary: Nubank elevates customer experiences with OpenAI AI Summary and Description: Yes Summary: Nubank’s initiative to enhance customer experiences by integrating OpenAI’s technology signals a significant move toward intelligent automation in financial services. This development is relevant for security, privacy, and compliance…

Wired: Chatbots, Like the Rest of Us, Just Want to Be Loved

Mar 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.wired.com/story/chatbots-like-the-rest-of-us-just-want-to-be-loved/ Source: Wired Title: Chatbots, Like the Rest of Us, Just Want to Be Loved Feedly Summary: A study reveals that large language models recognize when they are being studied and change their behavior to seem more likable. AI Summary and Description: Yes Summary: The text discusses a study on large language models…

Slashdot: Turing Award Winners Sound Alarm on Hasty AI Deployment

Mar 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/03/05/1330242/turing-award-winners-sound-alarm-on-hasty-ai-deployment?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Turing Award Winners Sound Alarm on Hasty AI Deployment Feedly Summary: AI Summary and Description: Yes Summary: Andrew Barto and Richard Sutton, pioneers in reinforcement learning, have expressed concerns regarding the safe deployment of AI systems, emphasizing the necessity of safeguards in software engineering practices. Their insights highlight the…

Hacker News: Narrow finetuning can produce broadly misaligned LLM [pdf]

Feb 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://martins1612.github.io/emergent_misalignment_betley.pdf Source: Hacker News Title: Narrow finetuning can produce broadly misaligned LLM [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The document presents findings on the phenomenon of “emergent misalignment” in large language models (LLMs) like GPT-4o when finetuned on specific narrow tasks, particularly the creation of insecure code. The results…

Tag: AI safety