Tag: moderation
-
Hacker News: O3-mini System Card [pdf]
Source URL: https://cdn.openai.com/o3-mini-system-card.pdf Source: Hacker News Title: O3-mini System Card [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The OpenAI o3-mini System Card details the advanced capabilities, safety evaluations, and risk classifications of the OpenAI o3-mini model. This document is particularly pertinent for professionals in AI security, as it outlines significant safety measures…
-
The Register: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depths
Source URL: https://www.theregister.com/2025/01/24/scale_ai_outlier_sued_over/ Source: The Register Title: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depths Feedly Summary: Who guards the guardrail makers? Not the bosses who hire them, it’s alleged Scale AI, which labels training data for machine-learning models, was sued this month, alongside labor platform…
-
The Register: OpenAI’s Operator agent wants to tackle your online chores – just don’t expect it to nail every task
Source URL: https://www.theregister.com/2025/01/23/openai_unveils_operator_agent/ Source: The Register Title: OpenAI’s Operator agent wants to tackle your online chores – just don’t expect it to nail every task Feedly Summary: Hello Operator? Can you give me number nine? Can I see you later? Will you give me back my dime? OpenAI on Thursday launched a human-directed AI agent…
-
The Register: EU demands a peek under the hood of X’s recommendation algorithms
Source URL: https://www.theregister.com/2025/01/17/eu_x_algorithm_changes/ Source: The Register Title: EU demands a peek under the hood of X’s recommendation algorithms Feedly Summary: Commission insists the timing has nothing to do with Musk meddling in German politics ahead of election The European Commission is stepping up its ongoing investigation of Elon Musk’s X with a request to examine…
-
Hacker News: Show HN: Llama 3.3 70B Sparse Autoencoders with API access
Source URL: https://www.goodfire.ai/papers/mapping-latent-spaces-llama/ Source: Hacker News Title: Show HN: Llama 3.3 70B Sparse Autoencoders with API access Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses innovative advancements made with the Llama 3.3 70B model, particularly the development and release of sparse autoencoders (SAEs) for interpretability and feature steering. These tools enhance…