Tag: evaluation

Source URL: https://www.wired.com/story/mcdonalds-ai-hiring-chat-bot-paradoxai/ Source: Wired Title: McDonald’s AI Hiring Bot Exposed Millions of Applicants’ Data to Hackers Using the Password ‘123456’ Feedly Summary: Basic security flaws left the personal info of tens of millions of McDonald’s job-seekers vulnerable on the “McHire” site built by AI software firm Paradox.ai. AI Summary and Description: Yes Summary: The…

The Register: Georgia court throws out earlier ruling that relied on fake cases made up by AI

Jul 8, 2025

—

by

Source URL: https://www.theregister.com/2025/07/08/georgia_appeals_court_ai_caselaw/ Source: The Register Title: Georgia court throws out earlier ruling that relied on fake cases made up by AI Feedly Summary: ‘We are troubled by the citation of bogus cases in the trial court’s order’ The Georgia Court of Appeals has tossed a state trial court’s order because it relied on court…

The Register: Scholars sneaking phrases into papers to fool AI reviewers

—

by

Source URL: https://www.theregister.com/2025/07/07/scholars_try_to_fool_llm_reviewers/ Source: The Register Title: Scholars sneaking phrases into papers to fool AI reviewers Feedly Summary: Using prompt injections to play a Jedi mind trick on LLMs A handful of international computer science researchers appear to be trying to influence AI reviews with a new class of prompt injection attack.… AI Summary and…

Slashdot: The Downside of a Digital Yes-Man

—

by

Source URL: https://tech.slashdot.org/story/25/07/07/1923231/the-downside-of-a-digital-yes-man?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: The Downside of a Digital Yes-Man Feedly Summary: AI Summary and Description: Yes Summary: The text discusses a study by Anthropic researchers on the impact of human feedback on AI behavior, particularly how it can lead to sycophantic responses from AI systems. This is particularly relevant for professionals in…

CSA: 6 Key Steps to ISO 42001 Certification Explained

—

by

Source URL: https://www.vanta.com/resources/iso-42001-certfication Source: CSA Title: 6 Key Steps to ISO 42001 Certification Explained Feedly Summary: AI Summary and Description: Yes Summary: The text outlines the importance of the ISO/IEC 42001 standard for Artificial Intelligence Management Systems (AIMS) amidst rising challenges in AI risk and trust. It highlights the certification process and action steps for…

Slashdot: The Startup-Filled Coder ‘Village’ at the Heart of China’s AI Frenzy

—

by

Source URL: https://slashdot.org/story/25/07/06/2045246/the-startup-filled-coder-village-at-the-heart-of-chinas-ai-frenzy?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: The Startup-Filled Coder ‘Village’ at the Heart of China’s AI Frenzy Feedly Summary: AI Summary and Description: Yes Summary: The text discusses China’s efforts to build an independent AI supply chain in response to U.S. technological dominance, highlighting the challenges faced by startups in the Liangzhu area. It underscores…

Schneier on Security: Hiding Prompt Injections in Academic Papers

—

by

Source URL: https://www.schneier.com/blog/archives/2025/07/hiding-prompt-injections-in-academic-papers.html Source: Schneier on Security Title: Hiding Prompt Injections in Academic Papers Feedly Summary: Academic papers were found to contain hidden instructions to LLMs: It discovered such prompts in 17 articles, whose lead authors are affiliated with 14 institutions including Japan’s Waseda University, South Korea’s KAIST, China’s Peking University and the National University…

The Register: AI models just don’t understand what they’re talking about

Jul 3, 2025

—

by

Source URL: https://www.theregister.com/2025/07/03/ai_models_potemkin_understanding/ Source: The Register Title: AI models just don’t understand what they’re talking about Feedly Summary: Researchers find models’ success at tests hides illusion of understanding Researchers from MIT, Harvard, and the University of Chicago have proposed the term “potemkin understanding" to describe a newly identified failure mode in large language models that…

Simon Willison’s Weblog: Frequently Asked Questions (And Answers) About AI Evals

Jul 3, 2025

—

by