Tag: Tags:
-
Simon Willison’s Weblog: Coding with LLMs in the summer of 2025 (an update)
Source URL: https://simonwillison.net/2025/Jul/21/coding-with-llms/#atom-everything Source: Simon Willison’s Weblog Title: Coding with LLMs in the summer of 2025 (an update) Feedly Summary: Coding with LLMs in the summer of 2025 (an update) Salvatore Sanfilippo describes his current AI-assisted development workflow. He’s all-in on LLMs for code review, exploratory prototyping, pair-design and writing “part of the code under…
-
Simon Willison’s Weblog: OpenAI’s gold medal performance on the International Math Olympiad
Source URL: https://simonwillison.net/2025/Jul/19/openai-gold-medal-math-olympiad/#atom-everything Source: Simon Willison’s Weblog Title: OpenAI’s gold medal performance on the International Math Olympiad Feedly Summary: OpenAI’s gold medal performance on the International Math Olympiad OpenAI research scientist Alexander Wei: I’m excited to share that our latest @OpenAI experimental reasoning LLM has achieved a longstanding grand challenge in AI: gold medal-level performance…
-
Simon Willison’s Weblog: Quoting Steve Yegge
Source URL: https://simonwillison.net/2025/Jul/19/steve-yegge/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Steve Yegge Feedly Summary: So one of my favorite things to do is give my coding agents more and more permissions and freedom, just to see how far I can push their productivity without going too far off the rails. It’s a delicate balance. I haven’t…
-
Simon Willison’s Weblog: Quoting Paul Kedrosky
Source URL: https://simonwillison.net/2025/Jul/19/paul-kedrosky/ Source: Simon Willison’s Weblog Title: Quoting Paul Kedrosky Feedly Summary: One analyst recently speculated (via Ed Conard) that, based on Nvidia’s latest datacenter sales figures, AI capex may be ~2% of US GDP in 2025, given a standard multiplier. […] Capital expenditures on AI data centers is likely around 20% of the…
-
Simon Willison’s Weblog: How to run an LLM on your laptop
Source URL: https://simonwillison.net/2025/Jul/18/how-to-run-an-llm-on-your-laptop/ Source: Simon Willison’s Weblog Title: How to run an LLM on your laptop Feedly Summary: How to run an LLM on your laptop I talked to Grace Huckins for this piece from MIT Technology Review on running local models. Apparently she enjoyed my dystopian backup plan! Simon Willison has a plan for…
-
Simon Willison’s Weblog: Voxtral
Source URL: https://simonwillison.net/2025/Jul/16/voxtral/#atom-everything Source: Simon Willison’s Weblog Title: Voxtral Feedly Summary: Voxtral Mistral released their first audio-input models yesterday: Voxtral Small and Voxtral Mini. These state‑of‑the‑art speech understanding models are available in two sizes—a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache…
-
Simon Willison’s Weblog: xAI: "We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated"
Source URL: https://simonwillison.net/2025/Jul/15/xai-mitigated/ Source: Simon Willison’s Weblog Title: xAI: "We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated" Feedly Summary: xAI: “We spotted a couple of issues with Grok 4 recently that we immediately investigated & mitigated" They continue: One was that if you ask it "What is…
-
Simon Willison’s Weblog: Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity
Source URL: https://simonwillison.net/2025/Jul/12/ai-open-source-productivity/#atom-everything Source: Simon Willison’s Weblog Title: Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity Feedly Summary: Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity METR – for Model Evaluation & Threat Research – are a non-profit research institute founded by Beth Barnes, a former alignment researcher at…
-
Simon Willison’s Weblog: Grok 4 Heavy won’t reveal its system prompt
Source URL: https://simonwillison.net/2025/Jul/12/grok-4-heavy/#atom-everything Source: Simon Willison’s Weblog Title: Grok 4 Heavy won’t reveal its system prompt Feedly Summary: Grok 4 Heavy won’t reveal its system prompt Grok 4 Heavy is the “think much harder" version of Grok 4 that’s currenly only available on their $300/month plan. Jeremy Howard relays a report from a Grok 4…
-
Simon Willison’s Weblog: Quoting @grok
Source URL: https://simonwillison.net/2025/Jul/12/grok/#atom-everything Source: Simon Willison’s Weblog Title: Quoting @grok Feedly Summary: On the morning of July 8, 2025, we observed undesired responses and immediately began investigating. To identify the specific language in the instructions causing the undesired behavior, we conducted multiple ablations and experiments to pinpoint the main culprits. We identified the operative lines…