reasoning model – Experimental News Clipping Site

Simon Willison’s Weblog: GPT-5 pro

Oct 6, 2025

—

by

Source URL: https://simonwillison.net/2025/Oct/6/gpt-5-pro/ Source: Simon Willison’s Weblog Title: GPT-5 pro Feedly Summary: GPT-5 pro Here’s OpenAI’s model documentation for their GPT-5 pro model, released to their API today at their DevDay event. It has similar base characteristics to GPT-5: both share a September 30, 2024 knowledge cutoff and 400,000 context limit. GPT-5 pro has maximum…

Simon Willison’s Weblog: Quoting Nick Turley

Sep 28, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/28/nick-turley/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Nick Turley Feedly Summary: We’ve seen the strong reactions to 4o responses and want to explain what is happening. We’ve started testing a new safety routing system in ChatGPT. As we previously mentioned, when conversations touch on sensitive and emotional topics the system may switch mid-chat…

Simon Willison’s Weblog: Grok 4 Fast

Sep 21, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/20/grok-4-fast/ Source: Simon Willison’s Weblog Title: Grok 4 Fast Feedly Summary: Grok 4 Fast New hosted reasoning model from xAI that’s designed to be fast and extremely competitive on price. It has a 2 million token context window and “was trained end-to-end with tool-use reinforcement learning". It’s priced at $0.20/million input tokens and…

Simon Willison’s Weblog: Magistral 1.2

Sep 19, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/19/magistral/ Source: Simon Willison’s Weblog Title: Magistral 1.2 Feedly Summary: Mistral quietly released two new models yesterday: Magistral Small 1.2 (Apache 2.0, 96.1 GB on Hugging Face) and Magistral Medium 1.2 (not open weights same as Mistral’s other “medium" models.) Despite being described as "minor updates" to the Magistral 1.1 models these have…

Simon Willison’s Weblog: ICPC medals for OpenAI and Gemini

Sep 17, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/17/icpc/#atom-everything Source: Simon Willison’s Weblog Title: ICPC medals for OpenAI and Gemini Feedly Summary: In July it was the International Math Olympiad (OpenAI, Gemini), today it’s the International Collegiate Programming Contest (ICPC). Once again, both OpenAI and Gemini competed with models that achieved Gold medal performance. OpenAI’s Mostafa Rohaninejad: We received the problems…

Cloud Blog: How Baseten achieves 225% better cost-performance for AI inference (and you can too)

Sep 4, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/how-baseten-achieves-better-cost-performance-for-ai-inference/ Source: Cloud Blog Title: How Baseten achieves 225% better cost-performance for AI inference (and you can too) Feedly Summary: Baseten is one of a growing number of AI infrastructure providers, helping other startups run their models and experiments at speed and scale. Given the importance of those two factors to its customers,…

OpenAI : Building more helpful ChatGPT experiences for everyone

Sep 2, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://openai.com/index/building-more-helpful-chatgpt-experiences-for-everyone Source: OpenAI Title: Building more helpful ChatGPT experiences for everyone Feedly Summary: We’re partnering with experts, strengthening protections for teens with parental controls, and routing sensitive conversations to reasoning models in ChatGPT. AI Summary and Description: Yes Summary: The text outlines a commitment to enhancing teen safety through parental controls and the…

Simon Willison’s Weblog: DeepSeek 3.1

Aug 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/22/deepseek-31/#atom-everything Source: Simon Willison’s Weblog Title: DeepSeek 3.1 Feedly Summary: DeepSeek 3.1 The latest model from DeepSeek, a 685B monster (like DeepSeek v3 before it) but this time it’s a hybrid reasoning model. DeepSeek claim: DeepSeek-V3.1-Think achieves comparable answer quality to DeepSeek-R1-0528, while responding more quickly. Drew Breunig points out that their benchmarks…

Slashdot: LLMs’ ‘Simulated Reasoning’ Abilities Are a ‘Brittle Mirage,’ Researchers Find

Aug 12, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/08/11/2253229/llms-simulated-reasoning-abilities-are-a-brittle-mirage-researchers-find?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: LLMs’ ‘Simulated Reasoning’ Abilities Are a ‘Brittle Mirage,’ Researchers Find Feedly Summary: AI Summary and Description: Yes Summary: Recent investigations into chain-of-thought reasoning models in AI reveal limitations in their logical reasoning capabilities, suggesting they operate more as pattern-matchers than true reasoners. The findings raise crucial concerns for industries…

Simon Willison’s Weblog: Quoting Sam Altman

Aug 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/10/sam-altman/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Sam Altman Feedly Summary: the percentage of users using reasoning models each day is significantly increasing; for example, for free users we went from

Tag: reasoning model