Tag: llms
-
Simon Willison’s Weblog: Qwen/Qwen3-235B-A22B-Instruct-2507
Source URL: https://simonwillison.net/2025/Jul/22/qwen3-235b-a22b-instruct-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen/Qwen3-235B-A22B-Instruct-2507 Feedly Summary: Qwen/Qwen3-235B-A22B-Instruct-2507 Significant new model release from Qwen, published yesterday without much fanfare. This is a follow-up to their April release of the full Qwen 3 model family, which included a Qwen3-235B-A22B model which could handle both reasoning and non-reasoning prompts (via a /no_think toggle).…
-
Simon Willison’s Weblog: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data
Source URL: https://simonwillison.net/2025/Jul/22/subliminal-learning/ Source: Simon Willison’s Weblog Title: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data Feedly Summary: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data This new alignment paper from Anthropic wins my prize for best illustrative figure so far this year: The researchers found that…
-
Simon Willison’s Weblog: Our contribution to a global environmental standard for AI
Source URL: https://simonwillison.net/2025/Jul/22/mistral-environmental-standard/ Source: Simon Willison’s Weblog Title: Our contribution to a global environmental standard for AI Feedly Summary: Our contribution to a global environmental standard for AI Mistral have released environmental impact numbers for their largest model, Mistral Large 2, in more detail than I have seen from any of the other large AI…
-
Simon Willison’s Weblog: Gemini 2.5 Flash-Lite is now stable and generally available
Source URL: https://simonwillison.net/2025/Jul/22/gemini-25-flash-lite/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5 Flash-Lite is now stable and generally available Feedly Summary: Gemini 2.5 Flash-Lite is now stable and generally available The last remaining member of the Gemini 2.5 trio joins Pro and Flash in General Availability today. Gemini 2.5 Flash-Lite is the cheapest of the 2.5 family,…
-
Docker: Docker Unveils the Future of Agentic Apps at WeAreDevelopers
Source URL: https://www.docker.com/blog/wearedevelopers-docker-unveils-the-future-of-agentic-apps/ Source: Docker Title: Docker Unveils the Future of Agentic Apps at WeAreDevelopers Feedly Summary: Agentic applications – what actually are they and how do we make them easier to build, test, and deploy? At WeAreDevelopers, we defined agentic apps as those that use LLMs to define execution workflows based on desired goals…
-
Simon Willison’s Weblog: Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad
Source URL: https://simonwillison.net/2025/Jul/21/gemini-imo/#atom-everything Source: Simon Willison’s Weblog Title: Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad Feedly Summary: Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad OpenAI beat them to the punch in terms of publicity by publishing their…
-
Simon Willison’s Weblog: Coding with LLMs in the summer of 2025 (an update)
Source URL: https://simonwillison.net/2025/Jul/21/coding-with-llms/#atom-everything Source: Simon Willison’s Weblog Title: Coding with LLMs in the summer of 2025 (an update) Feedly Summary: Coding with LLMs in the summer of 2025 (an update) Salvatore Sanfilippo describes his current AI-assisted development workflow. He’s all-in on LLMs for code review, exploratory prototyping, pair-design and writing “part of the code under…
-
Simon Willison’s Weblog: OpenAI’s gold medal performance on the International Math Olympiad
Source URL: https://simonwillison.net/2025/Jul/19/openai-gold-medal-math-olympiad/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI’s gold medal performance on the International Math Olympiad Feedly Summary: OpenAI’s gold medal performance on the International Math Olympiad OpenAI research scientist Alexander Wei: I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance…
-
Simon Willison’s Weblog: Quoting Steve Yegge
Source URL: https://simonwillison.net/2025/Jul/19/steve-yegge/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Steve Yegge Feedly Summary: So one of my favorite things to do is give my coding agents more and more permissions and freedom, just to see how far I can push their productivity without going too far off the rails. It’s a delicate balance. I haven’t…
-
Simon Willison’s Weblog: How to run an LLM on your laptop
Source URL: https://simonwillison.net/2025/Jul/18/how-to-run-an-llm-on-your-laptop/ Source: Simon Willison’s Weblog Title: How to run an LLM on your laptop Feedly Summary: How to run an LLM on your laptop I talked to Grace Huckins for this piece from MIT Technology Review on running local models. Apparently she enjoyed my dystopian backup plan! Simon Willison has a plan for…