Tag: model governance

  • Simon Willison’s Weblog: TIL: Running a gpt-oss eval suite against LM Studio on a Mac

    Source URL: https://simonwillison.net/2025/Aug/17/gpt-oss-eval-suite/#atom-everything Source: Simon Willison’s Weblog Title: TIL: Running a gpt-oss eval suite against LM Studio on a Mac Feedly Summary: TIL: Running a gpt-oss eval suite against LM Studio on a Mac The other day I learned that OpenAI published a set of evals as part of their gpt-oss model release, described in…

  • OpenAI : Introducing gpt-oss

    Source URL: https://openai.com/index/introducing-gpt-oss Source: OpenAI Title: Introducing gpt-oss Feedly Summary: We’re releasing gpt-oss-120b and gpt-oss-20b—two state-of-the-art open-weight language models that deliver strong real-world performance at low cost. Available under the flexible Apache 2.0 license, these models outperform similarly sized open models on reasoning tasks, demonstrate strong tool use capabilities, and are optimized for efficient deployment…

  • AWS News Blog: Announcing Amazon Nova customization in Amazon SageMaker AI

    Source URL: https://aws.amazon.com/blogs/aws/announcing-amazon-nova-customization-in-amazon-sagemaker-ai/ Source: AWS News Blog Title: Announcing Amazon Nova customization in Amazon SageMaker AI Feedly Summary: AWS now enables extensive customization of Amazon Nova foundation models through SageMaker AI with techniques including continued pre-training, supervised fine-tuning, direct preference optimization, reinforcement learning from human feedback and model distillation to better address domain-specific requirements across…

  • Slashdot: OpenAI Rolls Out GPT-4.5

    Source URL: https://slashdot.org/story/25/02/27/2022254/openai-rolls-out-gpt-45?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: OpenAI Rolls Out GPT-4.5 Feedly Summary: AI Summary and Description: Yes Summary: OpenAI’s release of the GPT-4.5 model represents a significant enhancement in AI capabilities, particularly in natural language processing and coding efficiency. This model addresses prior issues with accuracy, aiming to reduce fabricated responses, which holds great relevance…

  • Hacker News: Open-R1: an open reproduction of DeepSeek-R1

    Source URL: https://huggingface.co/blog/open-r1 Source: Hacker News Title: Open-R1: an open reproduction of DeepSeek-R1 Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the release of DeepSeek-R1, a language model that significantly enhances reasoning capabilities through advanced training techniques, including reinforcement learning. The Open-R1 project aims to replicate and build upon DeepSeek-R1’s methodologies…

  • Hacker News: Certain names make ChatGPT grind to a halt, and we know why

    Source URL: https://arstechnica.com/information-technology/2024/12/certain-names-make-chatgpt-grind-to-a-halt-and-we-know-why/ Source: Hacker News Title: Certain names make ChatGPT grind to a halt, and we know why Feedly Summary: Comments AI Summary and Description: Yes Summary: The text discusses the operational nuances of OpenAI’s ChatGPT, particularly how certain names trigger output filtering within the model. This behavior illustrates potential challenges related to AI…