Tag: llm

  • Simon Willison’s Weblog: MCP Run Python

    Source URL: https://simonwillison.net/2025/Apr/18/mcp-run-python/ Source: Simon Willison’s Weblog Title: MCP Run Python Feedly Summary: MCP Run Python Pydantic AI’s MCP server for running LLM-generated Python code in a sandbox. They ended up using a trick I explored two years ago: using a Deno process to run Pyodide in a WebAssembly sandbox. Here’s a bit of a…

  • Simon Willison’s Weblog: Start building with Gemini 2.5 Flash

    Source URL: https://simonwillison.net/2025/Apr/17/start-building-with-gemini-25-flash/ Source: Simon Willison’s Weblog Title: Start building with Gemini 2.5 Flash Feedly Summary: Start building with Gemini 2.5 Flash Google Gemini’s latest model is Gemini 2.5 Flash, available in (paid) preview as gemini-2.5-flash-preview-04-17. Building upon the popular foundation of 2.0 Flash, this new version delivers a major upgrade in reasoning capabilities, while…

  • Simon Willison’s Weblog: Quoting Ted Sanders, OpenAI

    Source URL: https://simonwillison.net/2025/Apr/17/ted-sanders/ Source: Simon Willison’s Weblog Title: Quoting Ted Sanders, OpenAI Feedly Summary: Our hypothesis is that o4-mini is a much better model, but we’ll wait to hear feedback from developers. Evals only tell part of the story, and we wouldn’t want to prematurely deprecate a model that developers continue to find value in.…

  • Simon Willison’s Weblog: Quoting James Betker

    Source URL: https://simonwillison.net/2025/Apr/16/james-betker/#atom-everything Source: Simon Willison’s Weblog Title: Quoting James Betker Feedly Summary: I work for OpenAI. […] o4-mini is actually a considerably better vision model than o3, despite the benchmarks. Similar to how o3-mini-high was a much better coding model than o1. I would recommend using o4-mini-high over o3 for any task involving vision.…

  • Simon Willison’s Weblog: Introducing OpenAI o3 and o4-mini

    Source URL: https://simonwillison.net/2025/Apr/16/introducing-openai-o3-and-o4-mini/ Source: Simon Willison’s Weblog Title: Introducing OpenAI o3 and o4-mini Feedly Summary: Introducing OpenAI o3 and o4-mini OpenAI are really emphasizing tool use with these: For the first time, our reasoning models can agentically use and combine every tool within ChatGPT—this includes searching the web, analyzing uploaded files and other data with…

  • Simon Willison’s Weblog: openai/codex

    Source URL: https://simonwillison.net/2025/Apr/16/openai-codex/ Source: Simon Willison’s Weblog Title: openai/codex Feedly Summary: openai/codex Just released by OpenAI, a “lightweight coding agent that runs in your terminal". Looks like their version of Claude Code. Tags: ai-assisted-programming, generative-ai, ai-agents, openai, ai, llms AI Summary and Description: Yes Summary: OpenAI’s recently released lightweight coding agent, integrated into the terminal,…

  • The Register: Apple: Since you care about yOuR pRiVaCy, we’ll train our AI on made-up emails

    Source URL: https://www.theregister.com/2025/04/16/apple_ai_training_privacy/ Source: The Register Title: Apple: Since you care about yOuR pRiVaCy, we’ll train our AI on made-up emails Feedly Summary: It’s LLMs all the way down Apple, having starved its AI models of data by respecting customer privacy, plans to improve its chatbot suggestions by using made-up emails.… AI Summary and Description:…