Tag: Tags:

  • Simon Willison’s Weblog: Daniel Stenberg’s note on AI assisted curl bug reports

    Source URL: https://simonwillison.net/2025/Oct/2/curl/ Source: Simon Willison’s Weblog Title: Daniel Stenberg’s note on AI assisted curl bug reports Feedly Summary: Daniel Stenberg’s note on AI assisted curl bug reports Curl maintainer Daniel Stenberg on Mastodon: Joshua Rogers sent us a massive list of potential issues in #curl that he found using his set of AI assisted…

  • Simon Willison’s Weblog: Two more Chinese pelicans

    Source URL: https://simonwillison.net/2025/Oct/1/two-pelicans/#atom-everything Source: Simon Willison’s Weblog Title: Two more Chinese pelicans Feedly Summary: Two new models from Chinese AI labs in the past few days. I tried them both out using llm-openrouter: DeepSeek-V3.2-Exp from DeepSeek. Announcement, Tech Report, Hugging Face (690GB, MIT license). As an intermediate step toward our next-generation architecture, V3.2-Exp builds upon…

  • Simon Willison’s Weblog: Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now)

    Source URL: https://simonwillison.net/2025/Sep/29/claude-sonnet-4-5/ Source: Simon Willison’s Weblog Title: Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now) Feedly Summary: Anthropic released Claude Sonnet 4.5 today, with a very bold set of claims: Claude Sonnet 4.5 is the best coding model in the world. It’s the strongest model for…

  • Simon Willison’s Weblog: Armin Ronacher: 90%

    Source URL: https://simonwillison.net/2025/Sep/29/armin-ronacher-90/ Source: Simon Willison’s Weblog Title: Armin Ronacher: 90% Feedly Summary: Armin Ronacher: 90% The idea of AI writing “90% of the code" to-date has mostly been expressed by people who sell AI tooling. Over the last few months, I’ve increasingly seen the same idea come coming much more credible sources. Armin is…

  • Simon Willison’s Weblog: Quoting Scott Aaronson

    Source URL: https://simonwillison.net/2025/Sep/29/scott-aaronson/ Source: Simon Willison’s Weblog Title: Quoting Scott Aaronson Feedly Summary: Given a week or two to try out ideas and search the literature, I’m pretty sure that Freek and I could’ve solved this problem ourselves. Instead, though, I simply asked GPT5-Thinking. After five minutes, it gave me something confident, plausible-looking, and (I…

  • Simon Willison’s Weblog: Quoting Nick Turley

    Source URL: https://simonwillison.net/2025/Sep/28/nick-turley/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Nick Turley Feedly Summary: We’ve seen the strong reactions to 4o responses and want to explain what is happening. We’ve started testing a new safety routing system in ChatGPT. As we previously mentioned, when conversations touch on sensitive and emotional topics the system may switch mid-chat…

  • Simon Willison’s Weblog: Video models are zero-shot learners and reasoners

    Source URL: https://simonwillison.net/2025/Sep/27/video-models-are-zero-shot-learners-and-reasoners/ Source: Simon Willison’s Weblog Title: Video models are zero-shot learners and reasoners Feedly Summary: Video models are zero-shot learners and reasoners Fascinating new paper from Google DeepMind which makes a very convincing case that their Veo 3 model – and generative video models in general – serve a similar role in the…

  • Simon Willison’s Weblog: How to stop AI’s “lethal trifecta”

    Source URL: https://simonwillison.net/2025/Sep/26/how-to-stop-ais-lethal-trifecta/ Source: Simon Willison’s Weblog Title: How to stop AI’s “lethal trifecta” Feedly Summary: How to stop AI’s “lethal trifecta” This is the second mention of the lethal trifecta in the Economist in just the last week! Their earlier coverage was Why AI systems may never be secure on September 22nd – I…

  • Simon Willison’s Weblog: Improved Gemini 2.5 Flash and Flash-Lite

    Source URL: https://simonwillison.net/2025/Sep/25/improved-gemini-25-flash-and-flash-lite/#atom-everything Source: Simon Willison’s Weblog Title: Improved Gemini 2.5 Flash and Flash-Lite Feedly Summary: Improved Gemini 2.5 Flash and Flash-Lite Two new preview models from Google – updates to their fast and inexpensive Flash and Flash Lite families: The latest version of Gemini 2.5 Flash-Lite was trained and built based on three key…