Tag: harm

—

by

Source URL: https://slashdot.org/story/25/06/03/2149233/ai-pioneer-announces-non-profit-to-develop-honest-ai?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: AI Pioneer Announces Non-Profit To Develop ‘Honest’ AI Feedly Summary: AI Summary and Description: Yes Summary: Yoshua Bengio has established a $30 million non-profit, LawZero, to create “honest” AI systems aimed at detecting and preventing harmful behavior in autonomous agents. This initiative introduces a model, Scientist AI, designed to…

Simon Willison’s Weblog: Codex agent internet access

—

by

Source URL: https://simonwillison.net/2025/Jun/3/codex-agent-internet-access/ Source: Simon Willison’s Weblog Title: Codex agent internet access Feedly Summary: Codex agent internet access Sam Altman, just now: codex gets access to the internet today! it is off by default and there are complex tradeoffs; people should read about the risks carefully and use when it makes sense. This is the…

Cloud Blog: How Alpian is redefining private banking for the digital age with gen AI

—

by

Source URL: https://cloud.google.com/blog/topics/financial-services/how-alpian-is-redefining-private-banking-for-the-digital-age-with-gen-ai/ Source: Cloud Blog Title: How Alpian is redefining private banking for the digital age with gen AI Feedly Summary: As the first fully cloud-native private bank in Switzerland, Alpian stands at the forefront of digital innovation in the financial services sector. With its unique model blending personal wealth management and digital convenience,…

Slashdot: Jony Ive’s OpenAI Device Gets the Laurene Powell Jobs Nod of Approval

—

by

Source URL: https://apple.slashdot.org/story/25/06/02/2139234/jony-ives-openai-device-gets-the-laurene-powell-jobs-nod-of-approval?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Jony Ive’s OpenAI Device Gets the Laurene Powell Jobs Nod of Approval Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the endorsement of a secretive AI hardware device being developed by Jony Ive and OpenAI, with Laurene Powell Jobs expressing her support and investment in the…

Slashdot: Pro-AI Subreddit Bans ‘Uptick’ of Users Who Suffer From AI Delusions

—

by

Source URL: https://tech.slashdot.org/story/25/06/02/2156253/pro-ai-subreddit-bans-uptick-of-users-who-suffer-from-ai-delusions?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Pro-AI Subreddit Bans ‘Uptick’ of Users Who Suffer From AI Delusions Feedly Summary: AI Summary and Description: Yes Summary: The text highlights a concerning phenomenon where users in a pro-AI Reddit community are being banned for projecting grandiose beliefs about AI, particularly due to the influence of large language…

Unit 42: How Good Are the LLM Guardrails on the Market? A Comparative Study on the Effectiveness of LLM Content Filtering Across Major GenAI Platforms

Jun 2, 2025

—

by

Source URL: https://unit42.paloaltonetworks.com/comparing-llm-guardrails-across-genai-platforms/ Source: Unit 42 Title: How Good Are the LLM Guardrails on the Market? A Comparative Study on the Effectiveness of LLM Content Filtering Across Major GenAI Platforms Feedly Summary: We compare the effectiveness of content filtering guardrails across major GenAI platforms and identify common failure cases across different systems. The post How…

Microsoft Security Blog: Announcing a new strategic collaboration to bring clarity to threat actor naming

Jun 2, 2025

—

by

Source URL: https://www.microsoft.com/en-us/security/blog/2025/06/02/announcing-a-new-strategic-collaboration-to-bring-clarity-to-threat-actor-naming/ Source: Microsoft Security Blog Title: Announcing a new strategic collaboration to bring clarity to threat actor naming Feedly Summary: Microsoft and CrowdStrike are teaming up to create alignment across our individual threat actor taxonomies to help security professionals connect insights faster. The post Announcing a new strategic collaboration to bring clarity to…

Bulletins: Vulnerability Summary for the Week of May 26, 2025

Jun 2, 2025

—

by

Source URL: https://www.cisa.gov/news-events/bulletins/sb25-153 Source: Bulletins Title: Vulnerability Summary for the Week of May 26, 2025 Feedly Summary: High Vulnerabilities PrimaryVendor — Product Description Published CVSS Score Source Info 1000 Projects–Daily College Class Work Report Book A vulnerability classified as critical has been found in 1000 Projects Daily College Class Work Report Book 1.0. Affected is…

Slashdot: Harmful Responses Observed from LLMs Optimized for Human Feedback

Jun 1, 2025

—

by