Tag: Tags:
-
Simon Willison’s Weblog: Using GitHub Spark to reverse engineer GitHub Spark
Source URL: https://simonwillison.net/2025/Jul/24/github-spark/ Source: Simon Willison’s Weblog Title: Using GitHub Spark to reverse engineer GitHub Spark Feedly Summary: GitHub Spark was released in public preview yesterday. It’s GitHub’s implementation of the prompt-to-app pattern also seen in products like Claude Artifacts, Lovable, Vercel v0, Val Town Townie and Fly.io’s Phoenix New. I wrote about Spark back…
-
Simon Willison’s Weblog: Introducing OSS Rebuild: Open Source, Rebuilt to Last
Source URL: https://simonwillison.net/2025/Jul/23/oss-rebuild/ Source: Simon Willison’s Weblog Title: Introducing OSS Rebuild: Open Source, Rebuilt to Last Feedly Summary: Introducing OSS Rebuild: Open Source, Rebuilt to Last Major news on the Reproducible Builds front: the Google Security team have announced OSS Rebuild, their project to provide build attestations for open source packages released through the NPM,…
-
Simon Willison’s Weblog: TimeScope: How Long Can Your Video Large Multimodal Model Go?
Source URL: https://simonwillison.net/2025/Jul/23/timescope/#atom-everything Source: Simon Willison’s Weblog Title: TimeScope: How Long Can Your Video Large Multimodal Model Go? Feedly Summary: TimeScope: How Long Can Your Video Large Multimodal Model Go? New open source benchmark for evaluating vision LLMs on how well they handle long videos: TimeScope probes the limits of long-video capabilities by inserting several…
-
Simon Willison’s Weblog: Quoting ICML 2025
Source URL: https://simonwillison.net/2025/Jul/23/icml-2025/#atom-everything Source: Simon Willison’s Weblog Title: Quoting ICML 2025 Feedly Summary: Submitting a paper with a “hidden" prompt is scientific misconduct if that prompt is intended to obtain a favorable review from an LLM. The inclusion of such a prompt is an attempt to subvert the peer-review process. Although ICML 2025 reviewers are…
-
Simon Willison’s Weblog: Qwen3-Coder: Agentic Coding in the World
Source URL: https://simonwillison.net/2025/Jul/22/qwen3-coder/ Source: Simon Willison’s Weblog Title: Qwen3-Coder: Agentic Coding in the World Feedly Summary: Qwen3-Coder: Agentic Coding in the World It turns out that as I was typing up my notes on Qwen3-235B-A22B-Instruct-2507 the Qwen team were unleashing something much bigger: Today, we’re announcing Qwen3-Coder, our most agentic code model to date. Qwen3-Coder…
-
Simon Willison’s Weblog: Qwen/Qwen3-235B-A22B-Instruct-2507
Source URL: https://simonwillison.net/2025/Jul/22/qwen3-235b-a22b-instruct-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen/Qwen3-235B-A22B-Instruct-2507 Feedly Summary: Qwen/Qwen3-235B-A22B-Instruct-2507 Significant new model release from Qwen, published yesterday without much fanfare. This is a follow-up to their April release of the full Qwen 3 model family, which included a Qwen3-235B-A22B model which could handle both reasoning and non-reasoning prompts (via a /no_think toggle).…
-
Simon Willison’s Weblog: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data
Source URL: https://simonwillison.net/2025/Jul/22/subliminal-learning/ Source: Simon Willison’s Weblog Title: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data Feedly Summary: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data This new alignment paper from Anthropic wins my prize for best illustrative figure so far this year: The researchers found that…
-
Simon Willison’s Weblog: Our contribution to a global environmental standard for AI
Source URL: https://simonwillison.net/2025/Jul/22/mistral-environmental-standard/ Source: Simon Willison’s Weblog Title: Our contribution to a global environmental standard for AI Feedly Summary: Our contribution to a global environmental standard for AI Mistral have released environmental impact numbers for their largest model, Mistral Large 2, in more detail than I have seen from any of the other large AI…
-
Simon Willison’s Weblog: Gemini 2.5 Flash-Lite is now stable and generally available
Source URL: https://simonwillison.net/2025/Jul/22/gemini-25-flash-lite/#atom-everything Source: Simon Willison’s Weblog Title: Gemini 2.5 Flash-Lite is now stable and generally available Feedly Summary: Gemini 2.5 Flash-Lite is now stable and generally available The last remaining member of the Gemini 2.5 trio joins Pro and Flash in General Availability today. Gemini 2.5 Flash-Lite is the cheapest of the 2.5 family,…
-
Simon Willison’s Weblog: Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad
Source URL: https://simonwillison.net/2025/Jul/21/gemini-imo/#atom-everything Source: Simon Willison’s Weblog Title: Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad Feedly Summary: Advanced version of Gemini with Deep Think officially achieves gold-medal standard at the International Mathematical Olympiad OpenAI beat them to the punch in terms of publicity by publishing their…