Tag: Tags:
-
Simon Willison’s Weblog: too many model context protocol servers and LLM allocations on the dance floor
Source URL: https://simonwillison.net/2025/Aug/22/too-many-mcps/#atom-everything Source: Simon Willison’s Weblog Title: too many model context protocol servers and LLM allocations on the dance floor Feedly Summary: too many model context protocol servers and LLM allocations on the dance floor Useful reminder from Geoffrey Huntley of the infrequently discussed significant token cost of using MCP. Geoffrey estimate estimates that…
-
Simon Willison’s Weblog: Quoting Mustafa Suleyman
Source URL: https://simonwillison.net/2025/Aug/21/mustafa-suleyman/ Source: Simon Willison’s Weblog Title: Quoting Mustafa Suleyman Feedly Summary: Simply put, my central worry is that many people will start to believe in the illusion of AIs as conscious entities so strongly that they’ll soon advocate for AI rights, model welfare and even AI citizenship. This development will be a dangerous…
-
Simon Willison’s Weblog: llama.cpp guide: running gpt-oss with llama.cpp
Source URL: https://simonwillison.net/2025/Aug/19/gpt-oss-with-llama-cpp/ Source: Simon Willison’s Weblog Title: llama.cpp guide: running gpt-oss with llama.cpp Feedly Summary: llama.cpp guide: running gpt-oss with llama.cpp Really useful official guide to running the OpenAI gpt-oss models using llama-server from llama.cpp – which provides an OpenAI-compatible localhost API and a neat web interface for interacting with the models. TLDR version…
-
Simon Willison’s Weblog: Google Gemini URL Context
Source URL: https://simonwillison.net/2025/Aug/18/google-gemini-url-context/ Source: Simon Willison’s Weblog Title: Google Gemini URL Context Feedly Summary: Google Gemini URL Context New feature in the Gemini API: you can now enable a url_context tool which the models can use to request the contents of URLs as part of replying to a prompt. I released llm-gemini 0.25 with a…
-
Simon Willison’s Weblog: TIL: Running a gpt-oss eval suite against LM Studio on a Mac
Source URL: https://simonwillison.net/2025/Aug/17/gpt-oss-eval-suite/#atom-everything Source: Simon Willison’s Weblog Title: TIL: Running a gpt-oss eval suite against LM Studio on a Mac Feedly Summary: TIL: Running a gpt-oss eval suite against LM Studio on a Mac The other day I learned that OpenAI published a set of evals as part of their gpt-oss model release, described in…
-
Simon Willison’s Weblog: GPT-5 has a hidden system prompt
Source URL: https://simonwillison.net/2025/Aug/15/gpt-5-has-a-hidden-system-prompt/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5 has a hidden system prompt Feedly Summary: GPT-5 has a hidden system prompt It looks like GPT-5 when accessed via the OpenAI API may have its own hidden system prompt, independent from the system prompt you can specify in an API call. At the very least…
-
Simon Willison’s Weblog: The Summer of Johann: prompt injections as far as the eye can see
Source URL: https://simonwillison.net/2025/Aug/15/the-summer-of-johann/#atom-everything Source: Simon Willison’s Weblog Title: The Summer of Johann: prompt injections as far as the eye can see Feedly Summary: Independent AI researcher Johann Rehberger has had an absurdly busy August. Under the heading The Month of AI Bugs he has been publishing one report per day across an array of different…
-
Simon Willison’s Weblog: Meta’s AI rules have let bots hold ‘sensual’ chats with kids, offer false medical info
Source URL: https://simonwillison.net/2025/Aug/15/metas-ai-rules/ Source: Simon Willison’s Weblog Title: Meta’s AI rules have let bots hold ‘sensual’ chats with kids, offer false medical info Feedly Summary: Meta’s AI rules have let bots hold ‘sensual’ chats with kids, offer false medical info This is grim. Reuters got hold of a leaked copy Meta’s internal “GenAI: Content Risk…
-
Simon Willison’s Weblog: Open weight LLMs exhibit inconsistent performance across providers
Source URL: https://simonwillison.net/2025/Aug/15/inconsistent-performance/ Source: Simon Willison’s Weblog Title: Open weight LLMs exhibit inconsistent performance across providers Feedly Summary: Artificial Analysis published a new benchmark the other day, this time focusing on how an individual model – OpenAI’s gpt-oss-120b – performs across different hosted providers. The results showed some surprising differences. Here’s the one with the…
-
Simon Willison’s Weblog: Introducing Gemma 3 270M: The compact model for hyper-efficient AI
Source URL: https://simonwillison.net/2025/Aug/14/gemma-3-270m/#atom-everything Source: Simon Willison’s Weblog Title: Introducing Gemma 3 270M: The compact model for hyper-efficient AI Feedly Summary: Introducing Gemma 3 270M: The compact model for hyper-efficient AI New from Google: Gemma 3 270M, a compact, 270-million parameter model designed from the ground up for task-specific fine-tuning with strong instruction-following and text structuring…