Tag: AI behavior
-
The Register: AI that once called itself MechaHitler will now be available to the US government for $0.42
Source URL: https://www.theregister.com/2025/09/25/grokai_servces_us_government/ Source: The Register Title: AI that once called itself MechaHitler will now be available to the US government for $0.42 Feedly Summary: Elon Musk’s AI appears to be more ideological than competitors Despite protest letters, concerns that it’s biased and untrustworthy, model tweaks to appease its billionaire boss, and even a past…
-
OpenAI : Detecting and reducing scheming in AI models
Source URL: https://openai.com/index/detecting-and-reducing-scheming-in-ai-models Source: OpenAI Title: Detecting and reducing scheming in AI models Feedly Summary: Apollo Research and OpenAI developed evaluations for hidden misalignment (“scheming”) and found behaviors consistent with scheming in controlled tests across frontier models. The team shared concrete examples and stress tests of an early method to reduce scheming. AI Summary and…
-
OpenAI : Why language models hallucinate
Source URL: https://openai.com/index/why-language-models-hallucinate Source: OpenAI Title: Why language models hallucinate Feedly Summary: OpenAI’s new research explains why language models hallucinate. The findings show how improved evaluations can enhance AI reliability, honesty, and safety. AI Summary and Description: Yes Summary: The text discusses OpenAI’s research on the phenomenon of hallucination in language models, offering insights into…
-
The Register: ChatGPT hates LA Chargers fans
Source URL: https://www.theregister.com/2025/08/27/chatgpt_has_a_problem_with/ Source: The Register Title: ChatGPT hates LA Chargers fans Feedly Summary: Harvard researchers find model guardrails tailor query responses to user’s inferred politics and other affiliations OpenAI’s ChatGPT appears to be more likely to refuse to respond to questions posed by fans of the Los Angeles Chargers football team than to followers…
-
OpenAI : Collective alignment: public input on our Model Spec
Source URL: https://openai.com/index/collective-alignment-aug-2025-updates Source: OpenAI Title: Collective alignment: public input on our Model Spec Feedly Summary: OpenAI surveyed over 1,000 people worldwide on how AI should behave and compared their views to our Model Spec. Learn how collective alignment is shaping AI defaults to better reflect diverse human values and perspectives. AI Summary and Description:…