Tag: moderation
- 
		
		
		Hacker News: O3-mini System Card [pdf]Source URL: https://cdn.openai.com/o3-mini-system-card.pdf Source: Hacker News Title: O3-mini System Card [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The OpenAI o3-mini System Card details the advanced capabilities, safety evaluations, and risk classifications of the OpenAI o3-mini model. This document is particularly pertinent for professionals in AI security, as it outlines significant safety measures… 
- 
		
		
		The Register: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depthsSource URL: https://www.theregister.com/2025/01/24/scale_ai_outlier_sued_over/ Source: The Register Title: Mental toll: Scale AI, Outlier sued by humans paid to steer AI away from our darkest depths Feedly Summary: Who guards the guardrail makers? Not the bosses who hire them, it’s alleged Scale AI, which labels training data for machine-learning models, was sued this month, alongside labor platform… 
- 
		
		
		The Register: OpenAI’s Operator agent wants to tackle your online chores – just don’t expect it to nail every taskSource URL: https://www.theregister.com/2025/01/23/openai_unveils_operator_agent/ Source: The Register Title: OpenAI’s Operator agent wants to tackle your online chores – just don’t expect it to nail every task Feedly Summary: Hello Operator? Can you give me number nine? Can I see you later? Will you give me back my dime? OpenAI on Thursday launched a human-directed AI agent… 
- 
		
		
		The Register: EU demands a peek under the hood of X’s recommendation algorithmsSource URL: https://www.theregister.com/2025/01/17/eu_x_algorithm_changes/ Source: The Register Title: EU demands a peek under the hood of X’s recommendation algorithms Feedly Summary: Commission insists the timing has nothing to do with Musk meddling in German politics ahead of election The European Commission is stepping up its ongoing investigation of Elon Musk’s X with a request to examine… 
- 
		
		
		Hacker News: Show HN: Llama 3.3 70B Sparse Autoencoders with API accessSource URL: https://www.goodfire.ai/papers/mapping-latent-spaces-llama/ Source: Hacker News Title: Show HN: Llama 3.3 70B Sparse Autoencoders with API access Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The text discusses innovative advancements made with the Llama 3.3 70B model, particularly the development and release of sparse autoencoders (SAEs) for interpretability and feature steering. These tools enhance…