Tag: AI systems

Source URL: https://yro.slashdot.org/story/25/08/28/1643241/anthropic-will-start-training-its-ai-models-on-chat-transcripts?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Anthropic Will Start Training Its AI Models on Chat Transcripts Feedly Summary: AI Summary and Description: Yes Summary: Anthropic has announced a new policy regarding the use of user data for training its AI models, which now includes chat transcripts and coding sessions. Users must choose to opt out…

The Register: ChatGPT hates LA Chargers fans

Aug 28, 2025

—

by

Source URL: https://www.theregister.com/2025/08/27/chatgpt_has_a_problem_with/ Source: The Register Title: ChatGPT hates LA Chargers fans Feedly Summary: Harvard researchers find model guardrails tailor query responses to user’s inferred politics and other affiliations OpenAI’s ChatGPT appears to be more likely to refuse to respond to questions posed by fans of the Los Angeles Chargers football team than to followers…

OpenAI : Collective alignment: public input on our Model Spec

—

by

Source URL: https://openai.com/index/collective-alignment-aug-2025-updates Source: OpenAI Title: Collective alignment: public input on our Model Spec Feedly Summary: OpenAI surveyed over 1,000 people worldwide on how AI should behave and compared their views to our Model Spec. Learn how collective alignment is shaping AI defaults to better reflect diverse human values and perspectives. AI Summary and Description:…

Simon Willison’s Weblog: Quoting Bruce Schneier

—

by

Source URL: https://simonwillison.net/2025/Aug/27/bruce-schneier/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Bruce Schneier Feedly Summary: We simply don’t know to defend against these attacks. We have zero agentic AI systems that are secure against these attacks. Any AI that is working in an adversarial environment—and by this I mean that it may encounter untrusted training data or…

The Register: Uncle Sam throws AI ‘chili cook-off’ to spice up healthcare fraud detection

—

by

Source URL: https://www.theregister.com/2025/08/27/medicare_chili_cookoff/ Source: The Register Title: Uncle Sam throws AI ‘chili cook-off’ to spice up healthcare fraud detection Feedly Summary: No stew on the stove, but plenty of heat as devs compete to flag suspect Medicare data Seeking to rein in healthcare fraud, the US Centers for Medicare & Medicaid Services (CMS) is seeking…

OpenAI : OpenAI and Anthropic share findings from a joint safety evaluation

—

by

Source URL: https://openai.com/index/openai-anthropic-safety-evaluation Source: OpenAI Title: OpenAI and Anthropic share findings from a joint safety evaluation Feedly Summary: OpenAI and Anthropic share findings from a first-of-its-kind joint safety evaluation, testing each other’s models for misalignment, instruction following, hallucinations, jailbreaking, and more—highlighting progress, challenges, and the value of cross-lab collaboration. AI Summary and Description: Yes Summary:…

The Cloudflare Blog: AI Gateway now gives you access to your favorite AI models, dynamic routing and more — through just one endpoint

—

by

Source URL: https://blog.cloudflare.com/ai-gateway-aug-2025-refresh/ Source: The Cloudflare Blog Title: AI Gateway now gives you access to your favorite AI models, dynamic routing and more — through just one endpoint Feedly Summary: AI Gateway now gives you access to your favorite AI models, dynamic routing and more — through just one endpoint. AI Summary and Description: Yes…

Schneier on Security: We Are Still Unable to Secure LLMs from Malicious Inputs

—

by

Source URL: https://www.schneier.com/blog/archives/2025/08/we-are-still-unable-to-secure-llms-from-malicious-inputs.html Source: Schneier on Security Title: We Are Still Unable to Secure LLMs from Malicious Inputs Feedly Summary: Nice indirect prompt injection attack: Bargury’s attack starts with a poisoned document, which is shared to a potential victim’s Google Drive. (Bargury says a victim could have also uploaded a compromised file to their own…

The Register: Anthropic teases Claude for Chrome: Don’t try this at home

Aug 26, 2025

—

by