Tag: generative

  • Simon Willison’s Weblog: Gemma 3 QAT Models

    Source URL: https://simonwillison.net/2025/Apr/19/gemma-3-qat-models/ Source: Simon Willison’s Weblog Title: Gemma 3 QAT Models Feedly Summary: Gemma 3 QAT Models Interesting release from Google, as a follow-up to Gemma 3 from last month: To make Gemma 3 even more accessible, we are announcing new versions optimized with Quantization-Aware Training (QAT) that dramatically reduces memory requirements while maintaining…

  • Slashdot: As Russia and China ‘Seed Chatbots With Lies’, Any Bad Actor Could Game AI the Same Way

    Source URL: https://yro.slashdot.org/story/25/04/19/1531238/as-russia-and-china-seed-chatbots-with-lies-any-bad-actor-could-game-ai-the-same-way?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: As Russia and China ‘Seed Chatbots With Lies’, Any Bad Actor Could Game AI the Same Way Feedly Summary: AI Summary and Description: Yes Summary: The text discusses how Russia is automating the spread of misinformation to manipulate AI chatbots, potentially serving as a model for other malicious actors.…

  • Simon Willison’s Weblog: Quoting Andrew Ng

    Source URL: https://simonwillison.net/2025/Apr/18/andrew-ng/ Source: Simon Willison’s Weblog Title: Quoting Andrew Ng Feedly Summary: To me, a successful eval meets the following criteria. Say, we currently have system A, and we might tweak it to get a system B: If A works significantly better than B according to a skilled human judge, the eval should give…

  • Simon Willison’s Weblog: Image segmentation using Gemini 2.5

    Source URL: https://simonwillison.net/2025/Apr/18/gemini-image-segmentation/ Source: Simon Willison’s Weblog Title: Image segmentation using Gemini 2.5 Feedly Summary: Max Woolf pointed out this new feature of the Gemini 2.5 series in a comment on Hacker News: One hidden note from Gemini 2.5 Flash when diving deep into the documentation: for image inputs, not only can the model be…

  • Simon Willison’s Weblog: MCP Run Python

    Source URL: https://simonwillison.net/2025/Apr/18/mcp-run-python/ Source: Simon Willison’s Weblog Title: MCP Run Python Feedly Summary: MCP Run Python Pydantic AI’s MCP server for running LLM-generated Python code in a sandbox. They ended up using a trick I explored two years ago: using a Deno process to run Pyodide in a WebAssembly sandbox. Here’s a bit of a…

  • Simon Willison’s Weblog: Quoting Ted Sanders, OpenAI

    Source URL: https://simonwillison.net/2025/Apr/17/ted-sanders/ Source: Simon Willison’s Weblog Title: Quoting Ted Sanders, OpenAI Feedly Summary: Our hypothesis is that o4-mini is a much better model, but we’ll wait to hear feedback from developers. Evals only tell part of the story, and we wouldn’t want to prematurely deprecate a model that developers continue to find value in.…

  • Simon Willison’s Weblog: Quoting James Betker

    Source URL: https://simonwillison.net/2025/Apr/16/james-betker/#atom-everything Source: Simon Willison’s Weblog Title: Quoting James Betker Feedly Summary: I work for OpenAI. […] o4-mini is actually a considerably better vision model than o3, despite the benchmarks. Similar to how o3-mini-high was a much better coding model than o1. I would recommend using o4-mini-high over o3 for any task involving vision.…

  • Simon Willison’s Weblog: Introducing OpenAI o3 and o4-mini

    Source URL: https://simonwillison.net/2025/Apr/16/introducing-openai-o3-and-o4-mini/ Source: Simon Willison’s Weblog Title: Introducing OpenAI o3 and o4-mini Feedly Summary: Introducing OpenAI o3 and o4-mini OpenAI are really emphasizing tool use with these: For the first time, our reasoning models can agentically use and combine every tool within ChatGPT—this includes searching the web, analyzing uploaded files and other data with…

  • Simon Willison’s Weblog: openai/codex

    Source URL: https://simonwillison.net/2025/Apr/16/openai-codex/ Source: Simon Willison’s Weblog Title: openai/codex Feedly Summary: openai/codex Just released by OpenAI, a “lightweight coding agent that runs in your terminal". Looks like their version of Claude Code. Tags: ai-assisted-programming, generative-ai, ai-agents, openai, ai, llms AI Summary and Description: Yes Summary: OpenAI’s recently released lightweight coding agent, integrated into the terminal,…