language models – Page 12 – Experimental News Clipping Site

Simon Willison’s Weblog: OpenAI’s new open weight (Apache 2) models are really good

Aug 5, 2025

—

by

Source URL: https://simonwillison.net/2025/Aug/5/gpt-oss/ Source: Simon Willison’s Weblog Title: OpenAI’s new open weight (Apache 2) models are really good Feedly Summary: The long promised OpenAI open weight models are here, and they are very impressive. They’re available under proper open source licenses – Apache 2.0 – and come in two sizes, 120B and 20B. OpenAI’s own…

Slashdot: OpenAI Releases First Open-Weight Models Since GPT-2

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/08/05/1848236/openai-releases-first-open-weight-models-since-gpt-2?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: OpenAI Releases First Open-Weight Models Since GPT-2 Feedly Summary: AI Summary and Description: Yes Summary: OpenAI’s release of two open-weight language models, gpt-oss-120b and gpt-oss-20b, marks a significant development in the AI landscape since 2019. These models enable local deployment on consumer devices and introduce advanced capabilities such as…

OpenAI : Introducing gpt-oss

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://openai.com/index/introducing-gpt-oss Source: OpenAI Title: Introducing gpt-oss Feedly Summary: We’re releasing gpt-oss-120b and gpt-oss-20b—two state-of-the-art open-weight language models that deliver strong real-world performance at low cost. Available under the flexible Apache 2.0 license, these models outperform similarly sized open models on reasoning tasks, demonstrate strong tool use capabilities, and are optimized for efficient deployment…

Simon Willison’s Weblog: Usage charts for my LLM tool against OpenRouter

Aug 4, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/4/llm-openrouter-usage/#atom-everything Source: Simon Willison’s Weblog Title: Usage charts for my LLM tool against OpenRouter Feedly Summary: Usage charts for my LLM tool against OpenRouter OpenRouter proxies requests to a large number of different LLMs and provides high level statistics of which models are the most popular among their users. Tools that call OpenRouter…

Tomasz Tunguz: Small Action Models Are the Future of AI Agents

Aug 4, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.tomtunguz.com/ai-skills-inversion/ Source: Tomasz Tunguz Title: Small Action Models Are the Future of AI Agents Feedly Summary: 2025 is the year of agents, and the key capability of agents is calling tools. When using Claude Code, I can tell the AI to sift through a newsletter, find all the links to startups, verify they…

Tomasz Tunguz: Small Action Models Are the Future of AI Agents

Aug 4, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.tomtunguz.com/local-instructions/ Source: Tomasz Tunguz Title: Small Action Models Are the Future of AI Agents Feedly Summary: 2025 is the year of agents, & the key capability of agents is calling tools. When using Claude Code, I can tell the AI to sift through a newsletter, find all the links to startups, verify they…

Simon Willison’s Weblog: Deep Think in the Gemini app

Aug 1, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/1/deep-think-in-the-gemini-app/ Source: Simon Willison’s Weblog Title: Deep Think in the Gemini app Feedly Summary: Deep Think in the Gemini app Google released Gemini 2.5 Deep Think this morning, exclusively to their Ultra ($250/month) subscribers: It is a variation of the model that recently achieved the gold-medal standard at this year’s International Mathematical Olympiad…

The Register: AI is contributing to Meta’s growth – just not the kind anyone cares about

Aug 1, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/08/01/meta_ai_investments/ Source: The Register Title: AI is contributing to Meta’s growth – just not the kind anyone cares about Feedly Summary: Good old machine learning, not LLMs, are what’s really paying for Zuck’s genAI splurge Believe it or not, Meta’s AI investments made a meaningful difference to its advertising business in Q2 —…

Simon Willison’s Weblog: Reverse engineering some updates to Claude

Jul 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/31/updates-to-claude/#atom-everything Source: Simon Willison’s Weblog Title: Reverse engineering some updates to Claude Feedly Summary: Anthropic released two major new features for their consumer-facing Claude apps in the past couple of days. Sadly, they don’t do a very good job of updating the release notes for those apps – neither of these releases came…

Simon Willison’s Weblog: More model releases on 31st July

Jul 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/31/more-models/ Source: Simon Willison’s Weblog Title: More model releases on 31st July Feedly Summary: Here are a few more model releases from today, to round out a very busy July: Cohere released Command A Vision, their first multi-modal (image input) LLM. Like their others it’s open weights under Creative Commons Attribution Non-Commercial, so…

Tag: language models