Tag: llm
-
Simon Willison’s Weblog: Using GitHub Spark to reverse engineer GitHub Spark
Source URL: https://simonwillison.net/2025/Jul/24/github-spark/ Source: Simon Willison’s Weblog Title: Using GitHub Spark to reverse engineer GitHub Spark Feedly Summary: GitHub Spark was released in public preview yesterday. It’s GitHub’s implementation of the prompt-to-app pattern also seen in products like Claude Artifacts, Lovable, Vercel v0, Val Town Townie and Fly.io’s Phoenix New. I wrote about Spark back…
-
Docker: Docker MCP Catalog: Finding the Right AI Tools for Your Project
Source URL: https://www.docker.com/blog/finding-the-right-ai-developer-tools-mcp-catalog/ Source: Docker Title: Docker MCP Catalog: Finding the Right AI Tools for Your Project Feedly Summary: As large language models (LLMs) evolve from static text generators to dynamic agents capable of executing actions, there’s a growing need for a standardized way to let them interact with external tooling securely. That’s where Model…
-
Simon Willison’s Weblog: Introducing OSS Rebuild: Open Source, Rebuilt to Last
Source URL: https://simonwillison.net/2025/Jul/23/oss-rebuild/ Source: Simon Willison’s Weblog Title: Introducing OSS Rebuild: Open Source, Rebuilt to Last Feedly Summary: Introducing OSS Rebuild: Open Source, Rebuilt to Last Major news on the Reproducible Builds front: the Google Security team have announced OSS Rebuild, their project to provide build attestations for open source packages released through the NPM,…
-
Simon Willison’s Weblog: TimeScope: How Long Can Your Video Large Multimodal Model Go?
Source URL: https://simonwillison.net/2025/Jul/23/timescope/#atom-everything Source: Simon Willison’s Weblog Title: TimeScope: How Long Can Your Video Large Multimodal Model Go? Feedly Summary: TimeScope: How Long Can Your Video Large Multimodal Model Go? New open source benchmark for evaluating vision LLMs on how well they handle long videos: TimeScope probes the limits of long-video capabilities by inserting several…
-
Simon Willison’s Weblog: Quoting ICML 2025
Source URL: https://simonwillison.net/2025/Jul/23/icml-2025/#atom-everything Source: Simon Willison’s Weblog Title: Quoting ICML 2025 Feedly Summary: Submitting a paper with a “hidden" prompt is scientific misconduct if that prompt is intended to obtain a favorable review from an LLM. The inclusion of such a prompt is an attempt to subvert the peer-review process. Although ICML 2025 reviewers are…
-
Cloud Blog: From Help Desk to Hypervisor: Defending Your VMware vSphere Estate from UNC3944
Source URL: https://cloud.google.com/blog/topics/threat-intelligence/defending-vsphere-from-unc3944/ Source: Cloud Blog Title: From Help Desk to Hypervisor: Defending Your VMware vSphere Estate from UNC3944 Feedly Summary: Introduction In mid 2025, Google Threat Intelligence Group (GITG) identified a sophisticated and aggressive cyber campaign targeting multiple industries, including retail, airline, and insurance. This was the work of UNC3944, a financially motivated threat…
-
CSA: What to Expect in the ISO 42001 Certification Process
Source URL: https://www.schellman.com/blog/iso-certifications/iso-42001-certification-processs Source: CSA Title: What to Expect in the ISO 42001 Certification Process Feedly Summary: AI Summary and Description: Yes **Summary:** The text discusses ISO 42001, the first international standard for responsible AI management, detailing its importance and the certification process. The standard is crucial for organizations involved in AI, ensuring ethical governance…
-
Simon Willison’s Weblog: Qwen3-Coder: Agentic Coding in the World
Source URL: https://simonwillison.net/2025/Jul/22/qwen3-coder/ Source: Simon Willison’s Weblog Title: Qwen3-Coder: Agentic Coding in the World Feedly Summary: Qwen3-Coder: Agentic Coding in the World It turns out that as I was typing up my notes on Qwen3-235B-A22B-Instruct-2507 the Qwen team were unleashing something much bigger: Today, we’re announcing Qwen3-Coder, our most agentic code model to date. Qwen3-Coder…
-
Simon Willison’s Weblog: Qwen/Qwen3-235B-A22B-Instruct-2507
Source URL: https://simonwillison.net/2025/Jul/22/qwen3-235b-a22b-instruct-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen/Qwen3-235B-A22B-Instruct-2507 Feedly Summary: Qwen/Qwen3-235B-A22B-Instruct-2507 Significant new model release from Qwen, published yesterday without much fanfare. This is a follow-up to their April release of the full Qwen 3 model family, which included a Qwen3-235B-A22B model which could handle both reasoning and non-reasoning prompts (via a /no_think toggle).…
-
Simon Willison’s Weblog: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data
Source URL: https://simonwillison.net/2025/Jul/22/subliminal-learning/ Source: Simon Willison’s Weblog Title: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data Feedly Summary: Subliminal Learning: Language Models Transmit Behavioral Traits via Hidden Signals in Data This new alignment paper from Anthropic wins my prize for best illustrative figure so far this year: The researchers found that…