Tag: language model

  • The Register: OpenAI makes good on its name, launches first open weights language models since GPT-2

    Source URL: https://www.theregister.com/2025/08/05/openai_open_gpt/ Source: The Register Title: OpenAI makes good on its name, launches first open weights language models since GPT-2 Feedly Summary: GPT-OSS now available in 120 and 20 billion parameter sizes under Apache 2.0 license OpenAI released its first open weights language models since GPT-2 on Tuesday with the debut of GPT-OSS.… AI…

  • Simon Willison’s Weblog: OpenAI’s new open weight (Apache 2) models are really good

    Source URL: https://simonwillison.net/2025/Aug/5/gpt-oss/ Source: Simon Willison’s Weblog Title: OpenAI’s new open weight (Apache 2) models are really good Feedly Summary: The long promised OpenAI open weight models are here, and they are very impressive. They’re available under proper open source licenses – Apache 2.0 – and come in two sizes, 120B and 20B. OpenAI’s own…

  • Slashdot: OpenAI Releases First Open-Weight Models Since GPT-2

    Source URL: https://slashdot.org/story/25/08/05/1848236/openai-releases-first-open-weight-models-since-gpt-2?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: OpenAI Releases First Open-Weight Models Since GPT-2 Feedly Summary: AI Summary and Description: Yes Summary: OpenAI’s release of two open-weight language models, gpt-oss-120b and gpt-oss-20b, marks a significant development in the AI landscape since 2019. These models enable local deployment on consumer devices and introduce advanced capabilities such as…

  • OpenAI : Introducing gpt-oss

    Source URL: https://openai.com/index/introducing-gpt-oss Source: OpenAI Title: Introducing gpt-oss Feedly Summary: We’re releasing gpt-oss-120b and gpt-oss-20b—two state-of-the-art open-weight language models that deliver strong real-world performance at low cost. Available under the flexible Apache 2.0 license, these models outperform similarly sized open models on reasoning tasks, demonstrate strong tool use capabilities, and are optimized for efficient deployment…

  • Tomasz Tunguz: Small Action Models Are the Future of AI Agents

    Source URL: https://www.tomtunguz.com/ai-skills-inversion/ Source: Tomasz Tunguz Title: Small Action Models Are the Future of AI Agents Feedly Summary: 2025 is the year of agents, and the key capability of agents is calling tools. When using Claude Code, I can tell the AI to sift through a newsletter, find all the links to startups, verify they…

  • Tomasz Tunguz: Small Action Models Are the Future of AI Agents

    Source URL: https://www.tomtunguz.com/local-instructions/ Source: Tomasz Tunguz Title: Small Action Models Are the Future of AI Agents Feedly Summary: 2025 is the year of agents, & the key capability of agents is calling tools. When using Claude Code, I can tell the AI to sift through a newsletter, find all the links to startups, verify they…

  • Simon Willison’s Weblog: XBai o4

    Source URL: https://simonwillison.net/2025/Aug/3/xbai-o4/#atom-everything Source: Simon Willison’s Weblog Title: XBai o4 Feedly Summary: XBai o4 Yet another open source (Apache 2.0) LLM from a Chinese AI lab. This model card claims: XBai o4 excels in complex reasoning capabilities and has now completely surpassed OpenAI-o3-mini in Medium mode. This a 32.8 billion parameter model released by MetaStone…

  • Simon Willison’s Weblog: Faster inference

    Source URL: https://simonwillison.net/2025/Aug/1/faster-inference/ Source: Simon Willison’s Weblog Title: Faster inference Feedly Summary: Two interesting examples of inference speed as a flagship feature of LLM services today. First, Cerebras announced two new monthly plans for their extremely high speed hosted model service: Cerebras Code Pro ($50/month, 1,000 messages a day) and Cerebras Code Max ($200/month, 5,000/day).…