Tag: Tags:

Source URL: https://simonwillison.net/2025/Jul/23/oss-rebuild/ Source: Simon Willison’s Weblog Title: Introducing OSS Rebuild: Open Source, Rebuilt to Last Feedly Summary: Introducing OSS Rebuild: Open Source, Rebuilt to Last Major news on the Reproducible Builds front: the Google Security team have announced OSS Rebuild, their project to provide build attestations for open source packages released through the NPM,…

Simon Willison’s Weblog: TimeScope: How Long Can Your Video Large Multimodal Model Go?

Jul 23, 2025

—

by

Source URL: https://simonwillison.net/2025/Jul/23/timescope/#atom-everything Source: Simon Willison’s Weblog Title: TimeScope: How Long Can Your Video Large Multimodal Model Go? Feedly Summary: TimeScope: How Long Can Your Video Large Multimodal Model Go? New open source benchmark for evaluating vision LLMs on how well they handle long videos: TimeScope probes the limits of long-video capabilities by inserting several…

Simon Willison’s Weblog: Quoting ICML 2025

Jul 23, 2025

—

by

Source URL: https://simonwillison.net/2025/Jul/23/icml-2025/#atom-everything Source: Simon Willison’s Weblog Title: Quoting ICML 2025 Feedly Summary: Submitting a paper with a “hidden" prompt is scientific misconduct if that prompt is intended to obtain a favorable review from an LLM. The inclusion of such a prompt is an attempt to subvert the peer-review process. Although ICML 2025 reviewers are…

Simon Willison’s Weblog: Qwen3-Coder: Agentic Coding in the World

—

by

Source URL: https://simonwillison.net/2025/Jul/22/qwen3-coder/ Source: Simon Willison’s Weblog Title: Qwen3-Coder: Agentic Coding in the World Feedly Summary: Qwen3-Coder: Agentic Coding in the World It turns out that as I was typing up my notes on Qwen3-235B-A22B-Instruct-2507 the Qwen team were unleashing something much bigger: Today, we’re announcing Qwen3-Coder, our most agentic code model to date. Qwen3-Coder…

Simon Willison’s Weblog: Qwen/Qwen3-235B-A22B-Instruct-2507

—

by

Source URL: https://simonwillison.net/2025/Jul/22/qwen3-235b-a22b-instruct-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen/Qwen3-235B-A22B-Instruct-2507 Feedly Summary: Qwen/Qwen3-235B-A22B-Instruct-2507 Significant new model release from Qwen, published yesterday without much fanfare. This is a follow-up to their April release of the full Qwen 3 model family, which included a Qwen3-235B-A22B model which could handle both reasoning and non-reasoning prompts (via a /no_think toggle).…

Simon Willison’s Weblog: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data

—

by

Source URL: https://simonwillison.net/2025/Jul/22/subliminal-learning/ Source: Simon Willison’s Weblog Title: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data Feedly Summary: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data This new alignment paper from Anthropic wins my prize for best illustrative figure so far this year: The researchers found that…

Simon Willison’s Weblog: Our contribution to a global environmental standard for AI

—

by

Source URL: https://simonwillison.net/2025/Jul/22/mistral-environmental-standard/ Source: Simon Willison’s Weblog Title: Our contribution to a global environmental standard for AI Feedly Summary: Our contribution to a global environmental standard for AI Mistral have released environmental impact numbers for their largest model, Mistral Large 2, in more detail than I have seen from any of the other large AI…

Simon Willison’s Weblog: Gemini 2.5 Flash-Lite is now stable and generally available

—

by

Source URL: https://simonwillison.net/2025/Jul/22/gemini-25-flash-lite/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5 Flash-Lite is now stable and generally available Feedly Summary: Gemini 2.5 Flash-Lite is now stable and generally available The last remaining member of the Gemini 2.5 trio joins Pro and Flash in General Availability today. Gemini 2.5 Flash-Lite is the cheapest of the 2.5 family,…

Simon Willison’s Weblog: Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad

Jul 21, 2025

—

by