Tag: AI systems
-
The Register: ChatGPT hates LA Chargers fans
Source URL: https://www.theregister.com/2025/08/27/chatgpt_has_a_problem_with/ Source: The Register Title: ChatGPT hates LA Chargers fans Feedly Summary: Harvard researchers find model guardrails tailor query responses to user’s inferred politics and other affiliations OpenAI’s ChatGPT appears to be more likely to refuse to respond to questions posed by fans of the Los Angeles Chargers football team than to followers…
-
OpenAI : Collective alignment: public input on our Model Spec
Source URL: https://openai.com/index/collective-alignment-aug-2025-updates Source: OpenAI Title: Collective alignment: public input on our Model Spec Feedly Summary: OpenAI surveyed over 1,000 people worldwide on how AI should behave and compared their views to our Model Spec. Learn how collective alignment is shaping AI defaults to better reflect diverse human values and perspectives. AI Summary and Description:…
-
The Register: Uncle Sam throws AI ‘chili cook-off’ to spice up healthcare fraud detection
Source URL: https://www.theregister.com/2025/08/27/medicare_chili_cookoff/ Source: The Register Title: Uncle Sam throws AI ‘chili cook-off’ to spice up healthcare fraud detection Feedly Summary: No stew on the stove, but plenty of heat as devs compete to flag suspect Medicare data Seeking to rein in healthcare fraud, the US Centers for Medicare & Medicaid Services (CMS) is seeking…
-
OpenAI : OpenAI and Anthropic share findings from a joint safety evaluation
Source URL: https://openai.com/index/openai-anthropic-safety-evaluation Source: OpenAI Title: OpenAI and Anthropic share findings from a joint safety evaluation Feedly Summary: OpenAI and Anthropic share findings from a first-of-its-kind joint safety evaluation, testing each other’s models for misalignment, instruction following, hallucinations, jailbreaking, and more—highlighting progress, challenges, and the value of cross-lab collaboration. AI Summary and Description: Yes Summary:…
-
The Cloudflare Blog: AI Gateway now gives you access to your favorite AI models, dynamic routing and more — through just one endpoint
Source URL: https://blog.cloudflare.com/ai-gateway-aug-2025-refresh/ Source: The Cloudflare Blog Title: AI Gateway now gives you access to your favorite AI models, dynamic routing and more — through just one endpoint Feedly Summary: AI Gateway now gives you access to your favorite AI models, dynamic routing and more — through just one endpoint. AI Summary and Description: Yes…
-
Schneier on Security: We Are Still Unable to Secure LLMs from Malicious Inputs
Source URL: https://www.schneier.com/blog/archives/2025/08/we-are-still-unable-to-secure-llms-from-malicious-inputs.html Source: Schneier on Security Title: We Are Still Unable to Secure LLMs from Malicious Inputs Feedly Summary: Nice indirect prompt injection attack: Bargury’s attack starts with a poisoned document, which is shared to a potential victim’s Google Drive. (Bargury says a victim could have also uploaded a compromised file to their own…
-
The Register: Anthropic teases Claude for Chrome: Don’t try this at home
Source URL: https://www.theregister.com/2025/08/26/anthropic_claude_chrome_warnings/ Source: The Register Title: Anthropic teases Claude for Chrome: Don’t try this at home Feedly Summary: AI am inevitable, AI firm argues Anthropic is now offering a research preview of Claude for Chrome, a browser extension that enables the firm’s machine learning model to automate web browsing.… AI Summary and Description: Yes…