Tag: llms
-
Simon Willison’s Weblog: AI-assisted development needs automated tests
Source URL: https://simonwillison.net/2025/May/28/automated-tests/ Source: Simon Willison’s Weblog Title: AI-assisted development needs automated tests Feedly Summary: I wonder if one of the reasons I’m finding LLMs so much more useful for coding than a lot of people that I see in online discussions is that effectively all of the code I work on has automated tests.…
-
Simon Willison’s Weblog: Large Language Models can run tools in your terminal with LLM 0.26
Source URL: https://simonwillison.net/2025/May/27/llm-tools/ Source: Simon Willison’s Weblog Title: Large Language Models can run tools in your terminal with LLM 0.26 Feedly Summary: LLM 0.26 is out with the biggest new feature since I started the project: support for tools. You can now use the LLM CLI tool – and Python library – to grant LLMs…
-
Simon Willison’s Weblog: Build AI agents with the Mistral Agents API
Source URL: https://simonwillison.net/2025/May/27/mistral-agents-api/ Source: Simon Willison’s Weblog Title: Build AI agents with the Mistral Agents API Feedly Summary: Build AI agents with the Mistral Agents API Big upgrade to Mistral’s API this morning: they’ve announced a new “Agents API". Mistral have been using the term "agents" for a while now. Here’s how they describe them:…
-
Scott Logic: Read the books! Should junior developers use LLMs?
Source URL: https://blog.scottlogic.com/2025/05/27/read-the-books-should-junior-developers-use-llms.html Source: Scott Logic Title: Read the books! Should junior developers use LLMs? Feedly Summary: Large Language Models are powerful tools that can greatly enhance software developers’ productivity, but for junior developers starting a career in tech, they may hinder long-term growth by abstracting away essential programming fundamentals. AI Summary and Description: Yes…
-
Simon Willison’s Weblog: GitHub MCP Exploited: Accessing private repositories via MCP
Source URL: https://simonwillison.net/2025/May/26/github-mcp-exploited/#atom-everything Source: Simon Willison’s Weblog Title: GitHub MCP Exploited: Accessing private repositories via MCP Feedly Summary: GitHub MCP Exploited: Accessing private repositories via MCP GitHub’s official MCP server grants LLMs a whole host of new abilities, including being able to read and issues in repositories the user has access to and submit new…
-
Simon Willison’s Weblog: AI Hallucination Cases
Source URL: https://simonwillison.net/2025/May/25/ai-hallucination-cases/#atom-everything Source: Simon Willison’s Weblog Title: AI Hallucination Cases Feedly Summary: AI Hallucination Cases Damien Charlotin maintains this database of cases around the world where a legal decision has been made that confirms hallucinated content from generative AI was presented by a lawyer. That’s an important distinction: this isn’t just cases where AI…
-
Simon Willison’s Weblog: Highlights from the Claude 4 system prompt
Source URL: https://simonwillison.net/2025/May/25/claude-4-system-prompt/ Source: Simon Willison’s Weblog Title: Highlights from the Claude 4 system prompt Feedly Summary: Anthropic publish most of the system prompts for their chat models as part of their release notes. They recently shared the new prompts for both Claude Opus 4 and Claude Sonnet 4. I enjoyed digging through the prompts,…
-
Simon Willison’s Weblog: System Card: Claude Opus 4 & Claude Sonnet 4
Source URL: https://simonwillison.net/2025/May/25/claude-4-system-card/#atom-everything Source: Simon Willison’s Weblog Title: System Card: Claude Opus 4 & Claude Sonnet 4 Feedly Summary: System Card: Claude Opus 4 & Claude Sonnet 4 Direct link to a PDF on Anthropic’s CDN because they don’t appear to have a landing page anywhere for this document. Anthropic’s system cards are always worth…
-
Simon Willison’s Weblog: Quoting Sean Heelan
Source URL: https://simonwillison.net/2025/May/24/sean-heelan/ Source: Simon Willison’s Weblog Title: Quoting Sean Heelan Feedly Summary: The vulnerability [o3] found is CVE-2025-37899 (fix here), a use-after-free in the handler for the SMB ‘logoff’ command. Understanding the vulnerability requires reasoning about concurrent connections to the server, and how they may share various objects in specific circumstances. o3 was able…
-
Slashdot: People Should Know About the ‘Beliefs’ LLMs Form About Them While Conversing
Source URL: https://slashdot.org/story/25/05/24/1946203/people-should-know-about-the-beliefs-llms-form-about-them-while-conversing Source: Slashdot Title: People Should Know About the ‘Beliefs’ LLMs Form About Them While Conversing Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the implications of using large language models (LLMs) like Llama that exhibit human-like biases based on user interactions. This raises critical policy and ethical issues related…