Tag: Tags:
-
Simon Willison’s Weblog: Quoting Greg Kamradt
Source URL: https://simonwillison.net/2025/Mar/25/greg-kamradt/ Source: Simon Willison’s Weblog Title: Quoting Greg Kamradt Feedly Summary: Today we’re excited to launch ARC-AGI-2 to challenge the new frontier. ARC-AGI-2 is even harder for AI (in particular, AI reasoning systems), while maintaining the same relative ease for humans. Pure LLMs score 0% on ARC-AGI-2, and public AI reasoning systems achieve…
-
Simon Willison’s Weblog: deepseek-ai/DeepSeek-V3-0324
Source URL: https://simonwillison.net/2025/Mar/24/deepseek/ Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-V3-0324 Feedly Summary: deepseek-ai/DeepSeek-V3-0324 Chinese AI lab DeepSeek just released the latest version of their enormous DeepSeek v3 model, baking the release date into the name DeepSeek-V3-0324. The license is MIT, the README is empty and the release adds up a to a total of 641 GB…
-
Simon Willison’s Weblog: The "think" tool: Enabling Claude to stop and think in complex tool use situations
Source URL: https://simonwillison.net/2025/Mar/21/the-think-tool/#atom-everything Source: Simon Willison’s Weblog Title: The "think" tool: Enabling Claude to stop and think in complex tool use situations Feedly Summary: The “think" tool: Enabling Claude to stop and think in complex tool use situations Fascinating new prompt engineering trick from Anthropic. They use their standard tool calling mechanism to define a…
-
Simon Willison’s Weblog: Anthropic Trust Center: Brave Search added as a subprocessor
Source URL: https://simonwillison.net/2025/Mar/21/anthropic-use-brave/#atom-everything Source: Simon Willison’s Weblog Title: Anthropic Trust Center: Brave Search added as a subprocessor Feedly Summary: Anthropic Trust Center: Brave Search added as a subprocessor Yesterday I was trying to figure out if Anthropic has rolled their own search index for Claude’s new web search feature or if they were working with…
-
Simon Willison’s Weblog: Anthropic Trust Center: Brave Search added as a subprocessor
Source URL: https://simonwillison.net/2025/Mar/21/anthropic-used-brave/#atom-everything Source: Simon Willison’s Weblog Title: Anthropic Trust Center: Brave Search added as a subprocessor Feedly Summary: Anthropic Trust Center: Brave Search added as a subprocessor Yesterday I was trying to figure out if Anthropic has rolled their own search index for Claude’s new web search feature or if they were working with…
-
Simon Willison’s Weblog: New audio models from OpenAI, but how much can we rely on them?
Source URL: https://simonwillison.net/2025/Mar/20/new-openai-audio-models/#atom-everything Source: Simon Willison’s Weblog Title: New audio models from OpenAI, but how much can we rely on them? Feedly Summary: OpenAI announced several new audio-related API features today, for both text-to-speech and speech-to-text. They’re very promising new models, but they appear to suffer from the ever-present risk of accidental (or malicious) instruction…
-
Simon Willison’s Weblog: Claude can now search the web
Source URL: https://simonwillison.net/2025/Mar/20/claude-can-now-search-the-web/#atom-everything Source: Simon Willison’s Weblog Title: Claude can now search the web Feedly Summary: Claude can now search the web Claude 3.7 Sonnet on the paid plan now has a web search tool that can be turned on as a global setting. This was sorely needed. ChatGPT, Gemini and Grok all had this…
-
Simon Willison’s Weblog: OpenAI platform: o1-pro
Source URL: https://simonwillison.net/2025/Mar/19/o1-pro/ Source: Simon Willison’s Weblog Title: OpenAI platform: o1-pro Feedly Summary: OpenAI platform: o1-pro OpenAI have a new most-expensive model: o1-pro can now be accessed through their API at a hefty $150/million tokens for input and $600/million tokens for output. That’s 10x the price of their o1 and o1-preview models and a full…
-
Simon Willison’s Weblog: Not all AI-assisted programming is vibe coding, but vibe coding rocks
Source URL: https://simonwillison.net/2025/Mar/19/vibe-coding/#atom-everything Source: Simon Willison’s Weblog Title: Not all AI-assisted programming is vibe coding, but vibe coding rocks Feedly Summary: Vibe coding is having a moment. The term was coined by Andrej Karpathy just a few weeks ago (on February 6th) and has since been featured in the New York Times, Ars Technica, the…
-
Simon Willison’s Weblog: My Thoughts on the Future of "AI"
Source URL: https://simonwillison.net/2025/Mar/19/my-thoughts-on-the-future-of-ai/ Source: Simon Willison’s Weblog Title: My Thoughts on the Future of "AI" Feedly Summary: My Thoughts on the Future of “AI" Nicholas Carlini, previously deeply skeptical about the utility of LLMs, discusses at length his thoughts on where the technology might go. He presents compelling, detailed arguments for both ends of the…