Tag: Tags:
-
Simon Willison’s Weblog: Defeating Nondeterminism in LLM Inference
Source URL: https://simonwillison.net/2025/Sep/11/defeating-nondeterminism/#atom-everything Source: Simon Willison’s Weblog Title: Defeating Nondeterminism in LLM Inference Feedly Summary: Defeating Nondeterminism in LLM Inference A very common question I see about LLMs concerns why they can’t be made to deliver the same response to the same prompt by setting a fixed random number seed. Like many others I had…
-
Simon Willison’s Weblog: Claude API: Web fetch tool
Source URL: https://simonwillison.net/2025/Sep/10/claude-web-fetch-tool/#atom-everything Source: Simon Willison’s Weblog Title: Claude API: Web fetch tool Feedly Summary: Claude API: Web fetch tool New in the Claude API: if you pass the web-fetch-2025-09-10 beta header you can add {“type": "web_fetch_20250910", "name": "web_fetch", "max_uses": 5} to your "tools" list and Claude will gain the ability to fetch content from…
-
Simon Willison’s Weblog: My review of Claude’s new Code Interpreter, released under a very confusing name
Source URL: https://simonwillison.net/2025/Sep/9/claude-code-interpreter/ Source: Simon Willison’s Weblog Title: My review of Claude’s new Code Interpreter, released under a very confusing name Feedly Summary: Today on the Anthropic blog: Claude can now create and edit files: Claude can now create and edit Excel spreadsheets, documents, PowerPoint slide decks, and PDFs directly in Claude.ai and the desktop…
-
Simon Willison’s Weblog: Anthropic status: Model output quality
Source URL: https://simonwillison.net/2025/Sep/9/anthropic-model-output-quality/ Source: Simon Willison’s Weblog Title: Anthropic status: Model output quality Feedly Summary: Anthropic status: Model output quality Anthropic previously reported model serving bugs that affected Claude Opus 4 and 4.1 for 56.5 hours. They’ve now fixed additional bugs affecting “a small percentage" of Sonnet 4 requests for almost a month, plus a…
-
Simon Willison’s Weblog: Load Llama-3.2 WebGPU in your browser from a local folder
Source URL: https://simonwillison.net/2025/Sep/8/webgpu-local-folder/#atom-everything Source: Simon Willison’s Weblog Title: Load Llama-3.2 WebGPU in your browser from a local folder Feedly Summary: Load Llama-3.2 WebGPU in your browser from a local folder Inspired by a comment on Hacker News I decided to see if it was possible to modify the transformers.js-examples/tree/main/llama-3.2-webgpu Llama 3.2 chat demo (online here,…
-
Simon Willison’s Weblog: Quoting James Luan
Source URL: https://simonwillison.net/2025/Sep/8/james-luan/ Source: Simon Willison’s Weblog Title: Quoting James Luan Feedly Summary: I recently spoke with the CTO of a popular AI note-taking app who told me something surprising: they spend twice as much on vector search as they do on OpenAI API calls. Think about that for a second. Running the retrieval layer…
-
Simon Willison’s Weblog: Is the LLM response wrong, or have you just failed to iterate it?
Source URL: https://simonwillison.net/2025/Sep/7/is-the-llm-response-wrong-or-have-you-just-failed-to-iterate-it/#atom-everything Source: Simon Willison’s Weblog Title: Is the LLM response wrong, or have you just failed to iterate it? Feedly Summary: Is the LLM response wrong, or have you just failed to iterate it? More from Mike Caulfield (see also the SIFT method). He starts with a fantastic example of Google’s AI mode…
-
Simon Willison’s Weblog: Quoting Jason Liu
Source URL: https://simonwillison.net/2025/Sep/6/jason-liu/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Jason Liu Feedly Summary: I am once again shocked at how much better image retrieval performance you can get if you embed highly opinionated summaries of an image, a summary that came out of a visual language model, than using CLIP embeddings themselves. If you tell…
-
Simon Willison’s Weblog: Kimi-K2-Instruct-0905
Source URL: https://simonwillison.net/2025/Sep/6/kimi-k2-instruct-0905/#atom-everything Source: Simon Willison’s Weblog Title: Kimi-K2-Instruct-0905 Feedly Summary: Kimi-K2-Instruct-0905 New not-quite-MIT licensed model from Chinese Moonshot AI, a follow-up to the highly regarded Kimi-K2 model they released in July. This one is an incremental improvement – I’ve seen it referred to online as “Kimi K-2.1". It scores a little higher on a…
-
Simon Willison’s Weblog: Why I think the $1.5 billion Anthropic class action settlement may count as a win for Anthropic
Source URL: https://simonwillison.net/2025/Sep/6/anthropic-settlement/#atom-everything Source: Simon Willison’s Weblog Title: Why I think the $1.5 billion Anthropic class action settlement may count as a win for Anthropic Feedly Summary: Anthropic to pay $1.5 billion to authors in landmark AI settlement I wrote about the details of this case when it was found that Anthropic’s training on book…