Tag: safety benchmark

  • Hacker News: Robust Autonomy Emerges from Self-Play

    Source URL: https://arxiv.org/abs/2502.03349 Source: Hacker News Title: Robust Autonomy Emerges from Self-Play Feedly Summary: Comments AI Summary and Description: Yes Summary: The research paper discusses the application of self-play in the domain of autonomous driving, highlighting an innovative approach that enables robust performance through simulation without relying on human training data. This work is particularly…

  • Schneier on Security: On Generative AI Security

    Source URL: https://www.schneier.com/blog/archives/2025/02/on-generative-ai-security.html Source: Schneier on Security Title: On Generative AI Security Feedly Summary: Microsoft’s AI Red Team just published “Lessons from Red Teaming 100 Generative AI Products.” Their blog post lists “three takeaways,” but the eight lessons in the report itself are more useful: Understand what the system can do and where it is…

  • Hacker News: O3-mini System Card [pdf]

    Source URL: https://cdn.openai.com/o3-mini-system-card.pdf Source: Hacker News Title: O3-mini System Card [pdf] Feedly Summary: Comments AI Summary and Description: Yes **Summary:** The OpenAI o3-mini System Card details the advanced capabilities, safety evaluations, and risk classifications of the OpenAI o3-mini model. This document is particularly pertinent for professionals in AI security, as it outlines significant safety measures…

  • The Register: Wish there was a benchmark for ML safety? Allow us to AILuminate you…

    Source URL: https://www.theregister.com/2024/12/05/mlcommons_ai_safety_benchmark/ Source: The Register Title: Wish there was a benchmark for ML safety? Allow us to AILuminate you… Feedly Summary: Very much a 1.0 – but it’s a solid start MLCommons, an industry-led AI consortium, on Wednesday introduced AILuminate – a benchmark for assessing the safety of large language models in products.… AI…