Tag: Tags:

  • Simon Willison’s Weblog: OpenAI: Introducing our latest image generation model in the API

    Source URL: https://simonwillison.net/2025/Apr/24/openai-images-api/ Source: Simon Willison’s Weblog Title: OpenAI: Introducing our latest image generation model in the API Feedly Summary: OpenAI: Introducing our latest image generation model in the API The astonishing native image generation capability of GPT-4o – a feature which continues to not have an obvious name – is now available via OpenAI’s…

  • Simon Willison’s Weblog: Exploring Promptfoo via Dave Guarino’s SNAP evals

    Source URL: https://simonwillison.net/2025/Apr/24/exploring-promptfoo/#atom-everything Source: Simon Willison’s Weblog Title: Exploring Promptfoo via Dave Guarino’s SNAP evals Feedly Summary: I used part three (here’s parts one and two) of Dave Guarino’s series on evaluating how well LLMs can answer questions about SNAP (aka food stamps) as an excuse to explore Promptfoo, an LLM eval tool. SNAP (Supplemental…

  • Simon Willison’s Weblog: llm-fragment-symbex

    Source URL: https://simonwillison.net/2025/Apr/23/llm-fragment-symbex/#atom-everything Source: Simon Willison’s Weblog Title: llm-fragment-symbex Feedly Summary: llm-fragment-symbex I released a new LLM fragment loader plugin that builds on top of my Symbex project. Symbex is a CLI tool I wrote that can run against a folder full of Python code and output functions, classes, methods or just their docstrings and…

  • Simon Willison’s Weblog: Quoting Ellie Huxtable

    Source URL: https://simonwillison.net/2025/Apr/22/ellie-huxtable/ Source: Simon Willison’s Weblog Title: Quoting Ellie Huxtable Feedly Summary: I was against using AI for programming for a LONG time. It never felt effective. But with the latest models + tools, it finally feels like a real performance boost If you’re still holding out, do yourself a favor: spend a few…

  • Simon Willison’s Weblog: OpenAI o3 and o4-mini System Card

    Source URL: https://simonwillison.net/2025/Apr/21/openai-o3-and-o4-mini-system-card/ Source: Simon Willison’s Weblog Title: OpenAI o3 and o4-mini System Card Feedly Summary: OpenAI o3 and o4-mini System Card I’m surprised to see a combined System Card for o3 and o4-mini in the same document – I’d expect to see these covered separately. The opening paragraph calls out the most interesting new…

  • Simon Willison’s Weblog: AI assisted search-based research actually works now

    Source URL: https://simonwillison.net/2025/Apr/21/ai-assisted-search/#atom-everything Source: Simon Willison’s Weblog Title: AI assisted search-based research actually works now Feedly Summary: For the past two and a half years the feature I’ve most wanted from LLMs is the ability to take on search-based research tasks on my behalf. We saw the first glimpses of this back in early 2023,…

  • Simon Willison’s Weblog: Quoting Ethan Mollick

    Source URL: https://simonwillison.net/2025/Apr/20/ethan-mollick/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Ethan Mollick Feedly Summary: In some tasks, AI is unreliable. In others, it is superhuman. You could, of course, say the same thing about calculators, but it is also clear that AI is different. It is already demonstrating general capabilities and performing a wide range of…

  • Simon Willison’s Weblog: Note on 20th April 2025

    Source URL: https://simonwillison.net/2025/Apr/20/janky-license/#atom-everything Source: Simon Willison’s Weblog Title: Note on 20th April 2025 Feedly Summary: Now that Llama has very real competition in open weight models (Gemma 3, latest Mistrals, DeepSeek, Qwen) I think their janky license is becoming much more of a liability for them. It’s just limiting enough that it could be the…

  • Simon Willison’s Weblog: llm-fragments-github 0.2

    Source URL: https://simonwillison.net/2025/Apr/20/llm-fragments-github/#atom-everything Source: Simon Willison’s Weblog Title: llm-fragments-github 0.2 Feedly Summary: llm-fragments-github 0.2 I upgraded my llm-fragments-github plugin to add a new fragment type called issue. It lets you pull the entire content of a GitHub issue thread into your prompt as a concatenated Markdown file. (If you haven’t seen fragments before I introduced…