Tag: generative

  • Simon Willison’s Weblog: Maybe Meta’s Llama claims to be open source because of the EU AI act

    Source URL: https://simonwillison.net/2025/Apr/19/llama-eu-ai-act/#atom-everything Source: Simon Willison’s Weblog Title: Maybe Meta’s Llama claims to be open source because of the EU AI act Feedly Summary: I encountered a theory a while ago that one of the reasons Meta insist on using the term “open source” for their Llama models despite the Llama license not actually conforming…

  • Simon Willison’s Weblog: Claude Code: Best practices for agentic coding

    Source URL: https://simonwillison.net/2025/Apr/19/claude-code-best-practices/#atom-everything Source: Simon Willison’s Weblog Title: Claude Code: Best practices for agentic coding Feedly Summary: Claude Code: Best practices for agentic coding Extensive new documentation from Anthropic on how to get the best results out of their Claude Code CLI coding agent tool, which includes this fascinating tip: We recommend using the word…

  • Simon Willison’s Weblog: Gemma 3 QAT Models

    Source URL: https://simonwillison.net/2025/Apr/19/gemma-3-qat-models/ Source: Simon Willison’s Weblog Title: Gemma 3 QAT Models Feedly Summary: Gemma 3 QAT Models Interesting release from Google, as a follow-up to Gemma 3 from last month: To make Gemma 3 even more accessible, we are announcing new versions optimized with Quantization-Aware Training (QAT) that dramatically reduces memory requirements while maintaining…

  • Slashdot: As Russia and China ‘Seed Chatbots With Lies’, Any Bad Actor Could Game AI the Same Way

    Source URL: https://yro.slashdot.org/story/25/04/19/1531238/as-russia-and-china-seed-chatbots-with-lies-any-bad-actor-could-game-ai-the-same-way?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: As Russia and China ‘Seed Chatbots With Lies’, Any Bad Actor Could Game AI the Same Way Feedly Summary: AI Summary and Description: Yes Summary: The text discusses how Russia is automating the spread of misinformation to manipulate AI chatbots, potentially serving as a model for other malicious actors.…

  • Simon Willison’s Weblog: Quoting Andrew Ng

    Source URL: https://simonwillison.net/2025/Apr/18/andrew-ng/ Source: Simon Willison’s Weblog Title: Quoting Andrew Ng Feedly Summary: To me, a successful eval meets the following criteria. Say, we currently have system A, and we might tweak it to get a system B: If A works significantly better than B according to a skilled human judge, the eval should give…

  • Simon Willison’s Weblog: Image segmentation using Gemini 2.5

    Source URL: https://simonwillison.net/2025/Apr/18/gemini-image-segmentation/ Source: Simon Willison’s Weblog Title: Image segmentation using Gemini 2.5 Feedly Summary: Max Woolf pointed out this new feature of the Gemini 2.5 series in a comment on Hacker News: One hidden note from Gemini 2.5 Flash when diving deep into the documentation: for image inputs, not only can the model be…

  • Simon Willison’s Weblog: MCP Run Python

    Source URL: https://simonwillison.net/2025/Apr/18/mcp-run-python/ Source: Simon Willison’s Weblog Title: MCP Run Python Feedly Summary: MCP Run Python Pydantic AI’s MCP server for running LLM-generated Python code in a sandbox. They ended up using a trick I explored two years ago: using a Deno process to run Pyodide in a WebAssembly sandbox. Here’s a bit of a…

  • Simon Willison’s Weblog: Quoting Ted Sanders, OpenAI

    Source URL: https://simonwillison.net/2025/Apr/17/ted-sanders/ Source: Simon Willison’s Weblog Title: Quoting Ted Sanders, OpenAI Feedly Summary: Our hypothesis is that o4-mini is a much better model, but we’ll wait to hear feedback from developers. Evals only tell part of the story, and we wouldn’t want to prematurely deprecate a model that developers continue to find value in.…

  • Simon Willison’s Weblog: Quoting James Betker

    Source URL: https://simonwillison.net/2025/Apr/16/james-betker/#atom-everything Source: Simon Willison’s Weblog Title: Quoting James Betker Feedly Summary: I work for OpenAI. […] o4-mini is actually a considerably better vision model than o3, despite the benchmarks. Similar to how o3-mini-high was a much better coding model than o1. I would recommend using o4-mini-high over o3 for any task involving vision.…