Tag: moderation

  • Slashdot: OpenAI Eases Content Restrictions For ChatGPT With New ‘Grown-Up Mode’

    Source URL: https://slashdot.org/story/25/02/14/2156202/openai-eases-content-restrictions-for-chatgpt-with-new-grown-up-mode?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: OpenAI Eases Content Restrictions For ChatGPT With New ‘Grown-Up Mode’ Feedly Summary: AI Summary and Description: Yes Summary: The recent update to OpenAI’s “Model Spec” showcases a significant policy change permitting the generation of sensitive content, such as erotica and gore, under specific conditions. This shift raises important implications…

  • The GenAI Bug Bounty Program | 0din.ai: The GenAI Bug Bounty Program

    Source URL: https://0din.ai/blog/odin-secures-the-future-of-ai-shopping Source: The GenAI Bug Bounty Program | 0din.ai Title: The GenAI Bug Bounty Program Feedly Summary: AI Summary and Description: Yes Summary: This text delves into a critical vulnerability uncovered in Amazon’s AI assistant, Rufus, focusing on how ASCII encoding allowed malicious requests to bypass existing guardrails. It emphasizes the need for…

  • The Register: DeepSeek rated too dodgy down under: Banned from Australian government devices

    Source URL: https://www.theregister.com/2025/02/05/australia_deepseek_ban/ Source: The Register Title: DeepSeek rated too dodgy down under: Banned from Australian government devices Feedly Summary: As American big tech companies lashed for their slow efforts to prevent harms Australia’s Department of Home Affairs has banned the use of DeepSeek on federal government devices.… AI Summary and Description: Yes Summary: Australia’s…

  • Hacker News: O3-mini System Card [pdf]

    Source URL: https://cdn.openai.com/o3-mini-system-card.pdf Source: Hacker News Title: O3-mini System Card [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The OpenAI o3-mini System Card details the advanced capabilities, safety evaluations, and risk classifications of the OpenAI o3-mini model. This document is particularly pertinent for professionals in AI security, as it outlines significant safety measures…

  • The Register: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depths

    Source URL: https://www.theregister.com/2025/01/24/scale_ai_outlier_sued_over/ Source: The Register Title: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depths Feedly Summary: Who guards the guardrail makers? Not the bosses who hire them, it’s alleged Scale AI, which labels training data for machine-learning models, was sued this month, alongside labor platform…

  • The Register: OpenAI’s Operator agent wants to tackle your online chores – just don’t expect it to nail every task

    Source URL: https://www.theregister.com/2025/01/23/openai_unveils_operator_agent/ Source: The Register Title: OpenAI’s Operator agent wants to tackle your online chores – just don’t expect it to nail every task Feedly Summary: Hello Operator? Can you give me number nine? Can I see you later? Will you give me back my dime? OpenAI on Thursday launched a human-directed AI agent…

  • The Register: EU demands a peek under the hood of X’s recommendation algorithms

    Source URL: https://www.theregister.com/2025/01/17/eu_x_algorithm_changes/ Source: The Register Title: EU demands a peek under the hood of X’s recommendation algorithms Feedly Summary: Commission insists the timing has nothing to do with Musk meddling in German politics ahead of election The European Commission is stepping up its ongoing investigation of Elon Musk’s X with a request to examine…

  • Wired: GitHub’s Deepfake Porn Crackdown Still Isn’t Working

    Source URL: https://www.wired.com/story/githubs-deepfake-porn-crackdown-still-isnt-working/ Source: Wired Title: GitHub’s Deepfake Porn Crackdown Still Isn’t Working Feedly Summary: Over a dozen programs used by creators of nonconsensual explicit images have evaded detection on the developer platform, WIRED has found. AI Summary and Description: Yes Summary: The text discusses the proliferation of deepfake technology, specifically its application in creating…