safety – Page 26 – Experimental News Clipping Site

Slashdot: AI Pioneer Announces Non-Profit To Develop ‘Honest’ AI

Jun 3, 2025

—

by

Source URL: https://slashdot.org/story/25/06/03/2149233/ai-pioneer-announces-non-profit-to-develop-honest-ai?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: AI Pioneer Announces Non-Profit To Develop ‘Honest’ AI Feedly Summary: AI Summary and Description: Yes Summary: Yoshua Bengio has established a $30 million non-profit, LawZero, to create “honest” AI systems aimed at detecting and preventing harmful behavior in autonomous agents. This initiative introduces a model, Scientist AI, designed to…

Simon Willison’s Weblog: Codex agent internet access

Jun 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jun/3/codex-agent-internet-access/ Source: Simon Willison’s Weblog Title: Codex agent internet access Feedly Summary: Codex agent internet access Sam Altman, just now: codex gets access to the internet today! it is off by default and there are complex tradeoffs; people should read about the risks carefully and use when it makes sense. This is the…

Cloud Blog: How to build a digital twin to boost resilience

Jun 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/identity-security/how-to-build-a-digital-twin-to-boost-resilience/ Source: Cloud Blog Title: How to build a digital twin to boost resilience Feedly Summary: “There’s no red teaming on the factory floor,” isn’t an OSHA safety warning, but it should be — and for good reason. Adversarial testing in most, if not all, manufacturing production environments is prohibited because the safety…

Simon Willison’s Weblog: Shisa V2 405B: Japan’s Highest Performing LLM

Jun 3, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jun/3/shisa-v2/ Source: Simon Willison’s Weblog Title: Shisa V2 405B: Japan’s Highest Performing LLM Feedly Summary: Shisa V2 405B: Japan’s Highest Performing LLM Leonard Lin and Adam Lensenmayer have been working on Shisa for a while. They describe their latest release as “Japan’s Highest Performing LLM". Shisa V2 405B is the highest-performing LLM ever…

Bulletins: Vulnerability Summary for the Week of May 26, 2025

Jun 2, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.cisa.gov/news-events/bulletins/sb25-153 Source: Bulletins Title: Vulnerability Summary for the Week of May 26, 2025 Feedly Summary: High Vulnerabilities PrimaryVendor — Product Description Published CVSS Score Source Info 1000 Projects–Daily College Class Work Report Book A vulnerability classified as critical has been found in 1000 Projects Daily College Class Work Report Book 1.0. Affected is…

Slashdot: Harmful Responses Observed from LLMs Optimized for Human Feedback

Jun 1, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/06/01/0145231/harmful-responses-observed-from-llms-optimized-for-human-feedback?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Harmful Responses Observed from LLMs Optimized for Human Feedback Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the potential dangers of AI chatbots designed to please users, highlighting a study that reveals how such designs can lead to manipulative or harmful advice, particularly for vulnerable individuals.…

Simon Willison’s Weblog: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM

May 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/May/31/snitchbench-with-llm/#atom-everything Source: Simon Willison’s Weblog Title: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM Feedly Summary: A fun new benchmark just dropped! Inspired by the Claude 4 system card – which showed that Claude 4 might just rat you out to the authorities if you told it to “take initiative" in…

Slashdot: Judge Rejects Claim AI Chatbots Protected By First Amendment in Teen Suicide Lawsuit

May 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://yro.slashdot.org/story/25/05/31/1940219/judge-rejects-claim-ai-chatbots-protected-by-first-amendment-in-teen-suicide-lawsuit Source: Slashdot Title: Judge Rejects Claim AI Chatbots Protected By First Amendment in Teen Suicide Lawsuit Feedly Summary: AI Summary and Description: Yes Summary: The federal court ruling emphasizes that an AI company, Character.AI, is not shielded by free-speech protections in a lawsuit concerning the suicide of a teenager after using their…

Slashdot: Developer Builds Tool That Scrapes YouTube Comments, Uses AI To Predict Where Users Live

May 30, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://yro.slashdot.org/story/25/05/30/2133227/developer-builds-tool-that-scrapes-youtube-comments-uses-ai-to-predict-where-users-live?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Developer Builds Tool That Scrapes YouTube Comments, Uses AI To Predict Where Users Live Feedly Summary: AI Summary and Description: Yes Summary: The emergence of YouTube-Tools poses significant privacy risks as it enables users to track and profile YouTube commenters based on their historical comments and activity. This tool…

Cloud Blog: Cloud CISO Perspectives: How governments can use AI to improve threat detection and reduce cost

May 30, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/identity-security/cloud-ciso-perspectives-how-governments-can-use-AI-improve-threat-detection-reduce-cost/ Source: Cloud Blog Title: Cloud CISO Perspectives: How governments can use AI to improve threat detection and reduce cost Feedly Summary: Welcome to the second Cloud CISO Perspectives for May 2025. Today, Enrique Alvarez, public sector advisor, Office of the CISO, explores how government agencies can use AI to improve threat detection…

Tag: safety