Tag: Well

  • Simon Willison’s Weblog: Quoting Kenton Varda

    Source URL: https://simonwillison.net/2025/Jun/2/kenton-varda/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Kenton Varda Feedly Summary: It took me a few days to build the library [cloudflare/workers-oauth-provider] with AI. I estimate it would have taken a few weeks, maybe months to write by hand. That said, this is a pretty ideal use case: implementing a well-known standard on…

  • Simon Willison’s Weblog: claude-trace

    Source URL: https://simonwillison.net/2025/Jun/2/claude-trace/ Source: Simon Willison’s Weblog Title: claude-trace Feedly Summary: claude-trace I’ve been thinking for a while it would be interesting to run some kind of HTTP proxy against the Claude Code CLI app and take a peek at how it works. Mario Zechner just published a really nice version of that. It works…

  • Slashdot: ‘Failure Imminent’: When LLMs In a Long-Running Vending Business Simulation Went Berserk

    Source URL: https://slashdot.org/story/25/05/31/2112240/failure-imminent-when-llms-in-a-long-running-vending-business-simulation-went-berserk?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: ‘Failure Imminent’: When LLMs In a Long-Running Vending Business Simulation Went Berserk Feedly Summary: AI Summary and Description: Yes Summary: The text describes a fascinating experiment where researchers tested the capabilities of advanced LLMs in managing a simulated vending machine business. The findings highlight significant operational failures and erratic…

  • Simon Willison’s Weblog: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM

    Source URL: https://simonwillison.net/2025/May/31/snitchbench-with-llm/#atom-everything Source: Simon Willison’s Weblog Title: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM Feedly Summary: A fun new benchmark just dropped! Inspired by the Claude 4 system card – which showed that Claude 4 might just rat you out to the authorities if you told it to “take initiative" in…

  • Simon Willison’s Weblog: deepseek-ai/DeepSeek-R1-0528

    Source URL: https://simonwillison.net/2025/May/31/deepseek-aideepseek-r1-0528/ Source: Simon Willison’s Weblog Title: deepseek-ai/DeepSeek-R1-0528 Feedly Summary: deepseek-ai/DeepSeek-R1-0528 Sadly the trend for terrible naming of models has infested the Chinese AI labs as well. DeepSeek-R1-0528 is a brand new and much improved open weights reasoning model from DeepSeek, a major step up from the DeepSeek R1 they released back in January.…

  • The Register: ConnectWise customers get mysterious warning about ‘sophisticated’ nation-state hack

    Source URL: https://www.theregister.com/2025/05/30/connectwise_compromised_by_sophisticated_government/ Source: The Register Title: ConnectWise customers get mysterious warning about ‘sophisticated’ nation-state hack Feedly Summary: Pen tester on ScreenConnect bug: This one ‘terrifies’ me ConnectWise has brought in the big guns to investigate a “sophisticated nation state actor" that broke into its IT environment and then breached some of its customers.… AI…