reasoning model – Page 2 – Experimental News Clipping Site

Simon Willison’s Weblog: GPT-5: Key characteristics, pricing and model card

Aug 7, 2025

—

by

Source URL: https://simonwillison.net/2025/Aug/7/gpt-5/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5: Key characteristics, pricing and model card Feedly Summary: I’ve had preview access to the new GPT-5 model family for the past two weeks, and have been using GPT-5 as my daily-driver. It’s my new favorite model. It’s still an LLM – it’s not a dramatic departure…

Simon Willison’s Weblog: OpenAI’s new open weight (Apache 2) models are really good

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/5/gpt-oss/ Source: Simon Willison’s Weblog Title: OpenAI’s new open weight (Apache 2) models are really good Feedly Summary: The long promised OpenAI open weight models are here, and they are very impressive. They’re available under proper open source licenses – Apache 2.0 – and come in two sizes, 120B and 20B. OpenAI’s own…

OpenAI : gpt-oss-120b & gpt-oss-20b Model Card

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://openai.com/index/gpt-oss-model-card Source: OpenAI Title: gpt-oss-120b & gpt-oss-20b Model Card Feedly Summary: We introduce gpt-oss-120b and gpt-oss-20b, two open-weight reasoning models available under the Apache 2.0 license and our gpt-oss usage policy. AI Summary and Description: Yes Summary: The introduction of gpt-oss-120b and gpt-oss-20b highlights the development of open-weight reasoning models, which reflects significant…

Simon Willison’s Weblog: More model releases on 31st July

Jul 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/31/more-models/ Source: Simon Willison’s Weblog Title: More model releases on 31st July Feedly Summary: Here are a few more model releases from today, to round out a very busy July: Cohere released Command A Vision, their first multi-modal (image input) LLM. Like their others it’s open weights under Creative Commons Attribution Non-Commercial, so…

Simon Willison’s Weblog: Qwen3-30B-A3B-Thinking-2507

Jul 30, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/30/qwen3-30b-a3b-thinking-2507/ Source: Simon Willison’s Weblog Title: Qwen3-30B-A3B-Thinking-2507 Feedly Summary: Qwen3-30B-A3B-Thinking-2507 Yesterday was Qwen3-30B-A3B-Instruct-2507. Qwen are clearly committed to their new split between reasoning and non-reasoning models (a reversal from Qwen 3 in April), because today they released the new reasoning partner to yesterday’s model: Qwen3-30B-A3B-Thinking-2507. I’m surprised at how poorly this reasoning mode…

Simon Willison’s Weblog: Qwen/Qwen3-30B-A3B-Instruct-2507

Jul 29, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/29/qwen3-30b-a3b-instruct-2507/ Source: Simon Willison’s Weblog Title: Qwen/Qwen3-30B-A3B-Instruct-2507 Feedly Summary: Qwen/Qwen3-30B-A3B-Instruct-2507 New model update from Qwen, improving on their previous Qwen3-30B-A3B release from late April. In their tweet they said: Smarter, faster, and local deployment-friendly. ✨ Key Enhancements: ✅ Enhanced reasoning, coding, and math skills ✅ Broader multilingual knowledge ✅ Improved long-context understanding (up…

Simon Willison’s Weblog: GLM-4.5: Reasoning, Coding, and Agentic Abililties

Jul 28, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/28/glm-45/#atom-everything Source: Simon Willison’s Weblog Title: GLM-4.5: Reasoning, Coding, and Agentic Abililties Feedly Summary: GLM-4.5: Reasoning, Coding, and Agentic Abililties Another day, another significant new open weight model release from a Chinese frontier AI lab. This time it’s Z.ai – who rebranded (at least in English) from Zhipu AI a few months ago.…

Slashdot: Huawei Shows Off 384-Chip AI Computing System That Rival Nvidia’s Top Product

Jul 27, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://hardware.slashdot.org/story/25/07/27/2248257/huawei-shows-off-384-chip-ai-computing-system-that-rival-nvidias-top-product Source: Slashdot Title: Huawei Shows Off 384-Chip AI Computing System That Rival Nvidia’s Top Product Feedly Summary: AI Summary and Description: Yes Summary: Huawei’s CloudMatrix 384 AI computing system, showcased at the World Artificial Intelligence Conference, offers significant performance metrics that rival Nvidia’s offerings despite export restrictions. Additionally, Alibaba introduced a new…

Slashdot: Meta Names Shengjia Zhao As Chief Scientist of AI Superintelligence Unit

Jul 26, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://tech.slashdot.org/story/25/07/25/2143237/meta-names-shengjia-zhao-as-chief-scientist-of-ai-superintelligence-unit Source: Slashdot Title: Meta Names Shengjia Zhao As Chief Scientist of AI Superintelligence Unit Feedly Summary: AI Summary and Description: Yes Summary: Meta has appointed Shengjia Zhao as Chief Scientist of its new Meta Superintelligence Labs (MSL), recognizing his expertise in AI reasoning models. This move is significant for the advancement of…

Simon Willison’s Weblog: Qwen3-235B-A22B-Thinking-2507

Jul 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/25/qwen3-235b-a22b-thinking-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3-235B-A22B-Thinking-2507 Feedly Summary: Qwen3-235B-A22B-Thinking-2507 The third Qwen model release week, following Qwen3-235B-A22B-Instruct-2507 on Monday 21st and Qwen3-Coder-480B-A35B-Instruct on Tuesday 22nd. Those two were both non-reasoning models – a change from the previous models in the Qwen 3 family which combined reasoning and non-reasoning in the same model,…

Tag: reasoning model