Tag: .NET

  • Simon Willison’s Weblog: AI Hallucination Cases

    Source URL: https://simonwillison.net/2025/May/25/ai-hallucination-cases/#atom-everything Source: Simon Willison’s Weblog Title: AI Hallucination Cases Feedly Summary: AI Hallucination Cases Damien Charlotin maintains this database of cases around the world where a legal decision has been made that confirms hallucinated content from generative AI was presented by a lawyer. That’s an important distinction: this isn’t just cases where AI…

  • Simon Willison’s Weblog: System Card: Claude Opus 4 & Claude Sonnet 4

    Source URL: https://simonwillison.net/2025/May/25/claude-4-system-card/#atom-everything Source: Simon Willison’s Weblog Title: System Card: Claude Opus 4 & Claude Sonnet 4 Feedly Summary: System Card: Claude Opus 4 & Claude Sonnet 4 Direct link to a PDF on Anthropic’s CDN because they don’t appear to have a landing page anywhere for this document. Anthropic’s system cards are always worth…

  • Simon Willison’s Weblog: Quoting Sean Heelan

    Source URL: https://simonwillison.net/2025/May/24/sean-heelan/ Source: Simon Willison’s Weblog Title: Quoting Sean Heelan Feedly Summary: The vulnerability [o3] found is CVE-2025-37899 (fix here), a use-after-free in the handler for the SMB ‘logoff’ command. Understanding the vulnerability requires reasoning about concurrent connections to the server, and how they may share various objects in specific circumstances. o3 was able…

  • Simon Willison’s Weblog: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft

    Source URL: https://simonwillison.net/2025/May/23/remote-prompt-injection-in-gitlab-duo/ Source: Simon Willison’s Weblog Title: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft Feedly Summary: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft Yet another example of the classic Markdown image exfiltration attack, this time affecting GitLab Duo – GitLab’s chatbot. Omer Mayraz reports on how…

  • Simon Willison’s Weblog: Updated Anthropic model comparison table

    Source URL: https://simonwillison.net/2025/May/22/updated-anthropic-models/#atom-everything Source: Simon Willison’s Weblog Title: Updated Anthropic model comparison table Feedly Summary: Updated Anthropic model comparison table A few details in here about Claude 4 that I hadn’t spotted elsewhere: The training cut-off date for Claude Opus 4 and Claude Sonnet 4 is March 2025! That’s the most recent cut-off for any…

  • Simon Willison’s Weblog: Agents are models using tools in a loop

    Source URL: https://simonwillison.net/2025/May/22/tools-in-a-loop/#atom-everything Source: Simon Willison’s Weblog Title: Agents are models using tools in a loop Feedly Summary: I was going slightly spare at the fact that every talk at this Anthropic developer conference has used the word “agents" dozens of times, but nobody ever stopped to provide a useful definition. I’m now in the…

  • Simon Willison’s Weblog: llm-anthropic 0.16

    Source URL: https://simonwillison.net/2025/May/22/llm-anthropic-016/#atom-everything Source: Simon Willison’s Weblog Title: llm-anthropic 0.16 Feedly Summary: llm-anthropic 0.16 New release of my LLM plugin for Anthropic adding the new Claude 4 Opus and Sonnet models. You can see pelicans on bicycles generated using the new plugin at the bottom of my live blog covering the release. I also released…

  • Simon Willison’s Weblog: Live blog: Claude 4 launch at Code with Claude

    Source URL: https://simonwillison.net/2025/May/22/code-with-claude-live-blog/ Source: Simon Willison’s Weblog Title: Live blog: Claude 4 launch at Code with Claude Feedly Summary: I’m at Anthropic’s Code with Claude event, where they are launching Claude 4. I’ll be live blogging the keynote here. Tags: llm-release, liveblogging, anthropic, claude, generative-ai, ai, llms AI Summary and Description: Yes Summary: The text…

  • Simon Willison’s Weblog: Devstral

    Source URL: https://simonwillison.net/2025/May/21/devstral/#atom-everything Source: Simon Willison’s Weblog Title: Devstral Feedly Summary: Devstral New Apache 2.0 licensed LLM release from Mistral, this time specifically trained for code. Devstral achieves a score of 46.8% on SWE-Bench Verified, outperforming prior open-source SoTA models by more than 6% points. When evaluated under the same test scaffold (OpenHands, provided by…