Tag: openrouter
-
Simon Willison’s Weblog: Qwen/Qwen3-235B-A22B-Instruct-2507
Source URL: https://simonwillison.net/2025/Jul/22/qwen3-235b-a22b-instruct-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen/Qwen3-235B-A22B-Instruct-2507 Feedly Summary: Qwen/Qwen3-235B-A22B-Instruct-2507 Significant new model release from Qwen, published yesterday without much fanfare. This is a follow-up to their April release of the full Qwen 3 model family, which included a Qwen3-235B-A22B model which could handle both reasoning and non-reasoning prompts (via a /no_think toggle).…
-
Simon Willison’s Weblog: Grok 4
Source URL: https://simonwillison.net/2025/Jul/10/grok-4/#atom-everything Source: Simon Willison’s Weblog Title: Grok 4 Feedly Summary: Grok 4 Released last night, Grok 4 is now available via both API and a paid subscription for end-users. Key characteristics: image and text input, text output. 256,000 context length (twice that of Grok 3). It’s a reasoning model where you can’t see…
-
Simon Willison’s Weblog: Become a command-line superhero with Simon Willison’s llm tool
Source URL: https://simonwillison.net/2025/Jul/7/become-a-command-line-superhero-with-simon-willisons-llm-tool/#atom-everything Source: Simon Willison’s Weblog Title: Become a command-line superhero with Simon Willison’s llm tool Feedly Summary: Become a command-line superhero with Simon Willison’s llm tool Christopher Smith ran a mini hackathon in Albany New York at the weekend around uses of my LLM – the first in-person event I’m aware of dedicated…
-
Simon Willison’s Weblog: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM
Source URL: https://simonwillison.net/2025/May/31/snitchbench-with-llm/#atom-everything Source: Simon Willison’s Weblog Title: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM Feedly Summary: A fun new benchmark just dropped! Inspired by the Claude 4 system card – which showed that Claude 4 might just rat you out to the authorities if you told it to “take initiative" in…
-
Simon Willison’s Weblog: Understanding the recent criticism of the Chatbot Arena
Source URL: https://simonwillison.net/2025/Apr/30/criticism-of-the-chatbot-arena/#atom-everything Source: Simon Willison’s Weblog Title: Understanding the recent criticism of the Chatbot Arena Feedly Summary: The Chatbot Arena has become the go-to place for vibes-based evaluation of LLMs over the past two years. The project, originating at UC Berkeley, is home to a large community of model enthusiasts who submit prompts to…
-
Simon Willison’s Weblog: Maybe Meta’s Llama claims to be open source because of the EU AI act
Source URL: https://simonwillison.net/2025/Apr/19/llama-eu-ai-act/#atom-everything Source: Simon Willison’s Weblog Title: Maybe Meta’s Llama claims to be open source because of the EU AI act Feedly Summary: I encountered a theory a while ago that one of the reasons Meta insist on using the term “open source” for their Llama models despite the Llama license not actually conforming…
-
Simon Willison’s Weblog: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet
Source URL: https://simonwillison.net/2025/Apr/14/gpt-4-1/ Source: Simon Willison’s Weblog Title: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet Feedly Summary: OpenAI introduced three new models this morning: GPT-4.1, GPT-4.1 mini and GPT-4.1 nano. These are API-only models right now, not available through the ChatGPT interface (though you can try them out…
-
Simon Willison’s Weblog: Note on 5th April 2025
Source URL: https://simonwillison.net/2025/Apr/5/llama-4-notes/#atom-everything Source: Simon Willison’s Weblog Title: Note on 5th April 2025 Feedly Summary: Dropping a model release as significant as Llama 4 on a weekend is plain unfair! So far the best place to learn about the new model family is this post on the Meta AI blog. You can try them out…
-
Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-0324
Source URL: https://simonwillison.net/2025/Mar/24/deepseek/ Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-0324 Feedly Summary: deepseek-ai/DeepSeek-V3-0324 Chinese AI lab DeepSeek just released the latest version of their enormous DeepSeek v3 model, baking the release date into the name DeepSeek-V3-0324. The license is MIT, the README is empty and the release adds up a to a total of 641 GB…
-
Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-0324
Source URL: https://simonwillison.net/2025/Mar/24/deepseek/ Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-0324 Feedly Summary: deepseek-ai/DeepSeek-V3-0324 Chinese AI lab DeepSeek just released the latest version of their enormous DeepSeek v3 model, baking the release date into the name DeepSeek-V3-0324. The license is MIT, the README is empty and the release adds up a to a total of 641 GB…