Wired: OpenAI Designed GPT-5 to Be Safer. It Still Outputs Gay Slurs

Aug 13, 2025

—

Source URL: https://www.wired.com/story/openai-gpt5-safety/
Source: Wired
Title: OpenAI Designed GPT-5 to Be Safer. It Still Outputs Gay Slurs

Feedly Summary: The new version of ChatGPT explains why it won’t generate rule-breaking outputs. WIRED’s initial analysis found that some guardrails were easy to circumvent.

AI Summary and Description: Yes

Summary: The text discusses a new version of ChatGPT and its mechanisms for not producing rule-breaking outputs. This information is particularly relevant to professionals in AI security, as it highlights ongoing efforts to enhance the safety and compliance of generative models.

Detailed Description: The provided text centers around enhancements made in the latest version of ChatGPT regarding compliance with predefined rules and safety protocols. This topic has significant implications in the field of AI security, especially in discussions about the effectiveness of guardrails in preventing inappropriate or harmful outputs.

Key points include:

– **Generative AI Improvements**: The text mentions that the new version of ChatGPT has made strides in preventing the generation of outputs that break rules, representing an important advancement in generative AI security.

– **Circumvention Issues**: It references an analysis that indicates there were previously some loopholes in these safety mechanisms, revealing the ongoing challenges in securing generative models against misuse.

– **Importance of Guardrails**: The mention of “guardrails” points to the critical nature of implementing effective safety features in AI systems to ensure they adhere to ethical standards and compliance regulations.

– **Implications for AI Security**: This development serves as a reminder for security professionals to continuously evaluate and enhance AI models to prevent potential rule violations and misuse.

The insights provide a foundation for ongoing discussions about improving AI security protocols and ensuring that technology adheres to appropriate guidelines. The reference to guardrails also opens up considerations around how such mechanisms can be designed and tested for robustness in real-world applications.

5 a advancement AI AI design ai model AI models AI security AI systems All analysis and app Application applications ARM art as at C centers challenge challenges chat ChatGPT CI CIA co Col compliance compliance regulations continuous critical D de DeFi design development e effective effectiveness ethical ethical standards event exp feature features fine for g Gen generation generative Generative AI generative model Generative Models Go GPT Guardrails guidelines H harm high Highlight http HTTPS implications improving in information insights io issue k Key l led Li loop loopholes M made misuse Mode model models N new NGO no o of on ons open openai OPM ory out output Outputs point potential pre pro professionals protocol protocols ps R rate RCE re real real-world applications red Regulation regulations RMF Ro robustness RoT s safe safety safety and compliance safety features safety mechanisms safety protocols sec security security professionals security protocols side Sig source SSE standards SUSE system systems T tech technology ted test text the to Tor TP UI up US use V val version Violations Wi world world application world applications x