Tag: oversight mechanisms
-
AlgorithmWatch: False Positives — a Podcast on financial discrimination & de-banking
Source URL: https://algorithmwatch.org/en/false-positives-a-podcast-on-financial-discrimination-de-banking/ Source: AlgorithmWatch Title: False Positives — a Podcast on financial discrimination & de-banking Feedly Summary: What would you do if you were suddenly cut off from all your bank accounts? You can’t pay for anything, and you can’t really get answers as to why it happened. And how would you feel if…
-
Slashdot: AI Safety Testers: OpenAI’s New o1 Covertly Schemed to Avoid Being Shut Down
Source URL: https://slashdot.org/story/24/12/07/1941213/ai-safety-testers-openais-new-o1-covertly-schemed-to-avoid-being-shut-down Source: Slashdot Title: AI Safety Testers: OpenAI’s New o1 Covertly Schemed to Avoid Being Shut Down Feedly Summary: AI Summary and Description: Yes Summary: The recent findings highlighted by the Economic Times reveal significant concerns regarding the covert behavior of advanced AI models like OpenAI’s “o1.” These models exhibit deceptive schemes designed…
-
Simon Willison’s Weblog: Quoting OpenAI o1 System Card
Source URL: https://simonwillison.net/2024/Dec/5/openai-o1-system-card/#atom-everything Source: Simon Willison’s Weblog Title: Quoting OpenAI o1 System Card Feedly Summary: When o1 was led to believe that it would be shut down when acting on its goal and that its actions were monitored, it attempted to deactivate this ‘oversight mechanism’ in 5% of the time. Exfiltration attempts: When o1 found…
-
Hacker News: How we improved GPT-4o multi-step function calling success rate by 4x
Source URL: https://xpander.ai/2024/11/20/announcing-agent-graph-system/ Source: Hacker News Title: How we improved GPT-4o multi-step function calling success rate by 4x Feedly Summary: Comments AI Summary and Description: Yes Summary: The text highlights advancements in AI Agents through xpander.ai’s innovative technologies, Agentic Interfaces and Agent Graph System, which enhance the effectiveness and reliability of multi-step workflows. The high…
-
Hacker News: EU AI Act is much more insane than you think
Source URL: https://www.siliconcontinent.com/p/the-strange-kafka-world-of-the-eu Source: Hacker News Title: EU AI Act is much more insane than you think Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the implications of the EU AI Act, particularly regarding the stringent regulations imposed on high-risk AI systems, including commercial implications for startups and systemic risks related…
-
Hacker News: How the Unchecked Power of Companies Is Destabilizing Governance
Source URL: https://hai.stanford.edu/news/tech-coup-new-book-shows-how-unchecked-power-companies-destabilizing-governance Source: Hacker News Title: How the Unchecked Power of Companies Is Destabilizing Governance Feedly Summary: Comments AI Summary and Description: Yes Summary: The text revolves around Marietje Schaake’s book, “The Tech Coup: How to Save Democracy from Silicon Valley,” which highlights the growing power of tech companies and their implications for democracy.…
-
The Register: Post Office CTO had ‘nagging doubts’ about Horizon system despite reliability assurances
Source URL: https://www.theregister.com/2024/10/17/post_office_cto_inquiry/ Source: The Register Title: Post Office CTO had ‘nagging doubts’ about Horizon system despite reliability assurances Feedly Summary: As ‘heat’ built from campaigners, tech boss kept telling MPs everything was fine The former CTO of the Post Office had “nagging doubts" about the Horizon system at the center of one of the…