Tag: thinking

  • Simon Willison’s Weblog: Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now)

    Source URL: https://simonwillison.net/2025/Sep/29/claude-sonnet-4-5/ Source: Simon Willison’s Weblog Title: Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now) Feedly Summary: Anthropic released Claude Sonnet 4.5 today, with a very bold set of claims: Claude Sonnet 4.5 is the best coding model in the world. It’s the strongest model for…

  • Simon Willison’s Weblog: Quoting Scott Aaronson

    Source URL: https://simonwillison.net/2025/Sep/29/scott-aaronson/ Source: Simon Willison’s Weblog Title: Quoting Scott Aaronson Feedly Summary: Given a week or two to try out ideas and search the literature, I’m pretty sure that Freek and I could’ve solved this problem ourselves. Instead, though, I simply asked GPT5-Thinking. After five minutes, it gave me something confident, plausible-looking, and (I…

  • Simon Willison’s Weblog: Quoting Nick Turley

    Source URL: https://simonwillison.net/2025/Sep/28/nick-turley/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Nick Turley Feedly Summary: We’ve seen the strong reactions to 4o responses and want to explain what is happening. We’ve started testing a new safety routing system in ChatGPT. As we previously mentioned, when conversations touch on sensitive and emotional topics the system may switch mid-chat…

  • Simon Willison’s Weblog: How to stop AI’s “lethal trifecta”

    Source URL: https://simonwillison.net/2025/Sep/26/how-to-stop-ais-lethal-trifecta/ Source: Simon Willison’s Weblog Title: How to stop AI’s “lethal trifecta” Feedly Summary: How to stop AI’s “lethal trifecta” This is the second mention of the lethal trifecta in the Economist in just the last week! Their earlier coverage was Why AI systems may never be secure on September 22nd – I…

  • Simon Willison’s Weblog: Improved Gemini 2.5 Flash and Flash-Lite

    Source URL: https://simonwillison.net/2025/Sep/25/improved-gemini-25-flash-and-flash-lite/#atom-everything Source: Simon Willison’s Weblog Title: Improved Gemini 2.5 Flash and Flash-Lite Feedly Summary: Improved Gemini 2.5 Flash and Flash-Lite Two new preview models from Google – updates to their fast and inexpensive Flash and Flash Lite families: The latest version of Gemini 2.5 Flash-Lite was trained and built based on three key…

  • Simon Willison’s Weblog: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action

    Source URL: https://simonwillison.net/2025/Sep/23/qwen3-vl/ Source: Simon Willison’s Weblog Title: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action Feedly Summary: Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action I’ve been looking forward to this. Qwen 2.5 VL is one of the best available open weight vision LLMs, so I had high hopes for Qwen 3’s vision models. Firstly, we…

  • Simon Willison’s Weblog: Four new releases from Qwen

    Source URL: https://simonwillison.net/2025/Sep/22/qwen/ Source: Simon Willison’s Weblog Title: Four new releases from Qwen Feedly Summary: It’s been an extremely busy day for team Qwen. Within the last 24 hours (all links to Twitter, which seems to be their preferred platform for these announcements): Qwen3-Next-80B-A3B-Instruct-FP8 and Qwen3-Next-80B-A3B-Thinking-FP8 – official FP8 quantized versions of their Qwen3-Next models.…

  • Simon Willison’s Weblog: CompileBench: Can AI Compile 22-year-old Code?

    Source URL: https://simonwillison.net/2025/Sep/22/compilebench/ Source: Simon Willison’s Weblog Title: CompileBench: Can AI Compile 22-year-old Code? Feedly Summary: CompileBench: Can AI Compile 22-year-old Code? Interesting new LLM benchmark from Piotr Grabowski and Piotr Migdał: how well can different models handle compilation challenges such as cross-compiling gucr for ARM64 architecture? This is one of my favorite applications of…

  • The Cloudflare Blog: Cloudflare’s 2025 Annual Founders’ Letter

    Source URL: https://blog.cloudflare.com/cloudflare-2025-annual-founders-letter/ Source: The Cloudflare Blog Title: Cloudflare’s 2025 Annual Founders’ Letter Feedly Summary: Cloudflare launched 15 years ago. We like to celebrate our birthday by launching new products that give back to the Internet. But we’ve also been thinking a lot about what’s changed on the Internet. AI Summary and Description: Yes Summary:…