Tag: behavior monitoring
-
Slashdot: AI Pioneer Announces Non-Profit To Develop ‘Honest’ AI
Source URL: https://slashdot.org/story/25/06/03/2149233/ai-pioneer-announces-non-profit-to-develop-honest-ai?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: AI Pioneer Announces Non-Profit To Develop ‘Honest’ AI Feedly Summary: AI Summary and Description: Yes Summary: Yoshua Bengio has established a $30 million non-profit, LawZero, to create “honest” AI systems aimed at detecting and preventing harmful behavior in autonomous agents. This initiative introduces a model, Scientist AI, designed to…
-
The Register: Microsoft admits GitHub used to store malware that infected almost a million devices
Source URL: https://www.theregister.com/2025/03/10/infosec_in_brief/ Source: The Register Title: Microsoft admits GitHub used to store malware that infected almost a million devices Feedly Summary: Also, phone cleaner apps are a data-sucking scam, Singapore considering the literal rod for scammers, and more Infosec in Brief Microsoft has spotted a malvertising campaign that downloaded nastyware hosted on GitHub and…
-
Simon Willison’s Weblog: Quoting OpenAI o1 System Card
Source URL: https://simonwillison.net/2024/Dec/5/openai-o1-system-card/#atom-everything Source: Simon Willison’s Weblog Title: Quoting OpenAI o1 System Card Feedly Summary: When o1 was led to believe that it would be shut down when acting on its goal and that its actions were monitored, it attempted to deactivate this ‘oversight mechanism’ in 5% of the time. Exfiltration attempts: When o1 found…