Tag: llms

  • Simon Willison’s Weblog: Datasette Public Office Hours: Tools in LLM

    Source URL: https://simonwillison.net/2025/Jun/3/datasette-public-office-hours/#atom-everything Source: Simon Willison’s Weblog Title: Datasette Public Office Hours: Tools in LLM Feedly Summary: We’re hosting the sixth in our series of Datasette Public Office Hours livestream sessions this Friday, 6th of June at 2pm PST (here’s that time in your location). The topic is going to be tool support in LLM,…

  • Simon Willison’s Weblog: Tips on prompting ChatGPT for UK technology secretary Peter Kyle

    Source URL: https://simonwillison.net/2025/Jun/3/tips-for-peter-kyle/#atom-everything Source: Simon Willison’s Weblog Title: Tips on prompting ChatGPT for UK technology secretary Peter Kyle Feedly Summary: Back in March New Scientist reported on a successful Freedom of Information request they had filed requesting UK Secretary of State for Science, Innovation and Technology Peter Kyle’s ChatGPT logs: New Scientist has obtained records…

  • Simon Willison’s Weblog: Run Your Own AI

    Source URL: https://simonwillison.net/2025/Jun/3/run-your-own-ai/ Source: Simon Willison’s Weblog Title: Run Your Own AI Feedly Summary: Run Your Own AI Anthony Lewis published this neat, concise tutorial on using my LLM tool to run local models on your own machine, using llm-mlx. An under-appreciated way to contribute to open source projects is to publish unofficial guides like…

  • Slashdot: Pro-AI Subreddit Bans ‘Uptick’ of Users Who Suffer From AI Delusions

    Source URL: https://tech.slashdot.org/story/25/06/02/2156253/pro-ai-subreddit-bans-uptick-of-users-who-suffer-from-ai-delusions?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Pro-AI Subreddit Bans ‘Uptick’ of Users Who Suffer From AI Delusions Feedly Summary: AI Summary and Description: Yes Summary: The text highlights a concerning phenomenon where users in a pro-AI Reddit community are being banned for projecting grandiose beliefs about AI, particularly due to the influence of large language…

  • Simon Willison’s Weblog: Shisa V2 405B: Japan’s Highest Performing LLM

    Source URL: https://simonwillison.net/2025/Jun/3/shisa-v2/ Source: Simon Willison’s Weblog Title: Shisa V2 405B: Japan’s Highest Performing LLM Feedly Summary: Shisa V2 405B: Japan’s Highest Performing LLM Leonard Lin and Adam Lensenmayer have been working on Shisa for a while. They describe their latest release as “Japan’s Highest Performing LLM". Shisa V2 405B is the highest-performing LLM ever…

  • Simon Willison’s Weblog: Quoting Kenton Varda

    Source URL: https://simonwillison.net/2025/Jun/2/kenton-varda/#atom-everything Source: Simon Willison’s Weblog Title: Quoting Kenton Varda Feedly Summary: It took me a few days to build the library [cloudflare/workers-oauth-provider] with AI. I estimate it would have taken a few weeks, maybe months to write by hand. That said, this is a pretty ideal use case: implementing a well-known standard on…

  • Simon Willison’s Weblog: claude-trace

    Source URL: https://simonwillison.net/2025/Jun/2/claude-trace/ Source: Simon Willison’s Weblog Title: claude-trace Feedly Summary: claude-trace I’ve been thinking for a while it would be interesting to run some kind of HTTP proxy against the Claude Code CLI app and take a peek at how it works. Mario Zechner just published a really nice version of that. It works…

  • Slashdot: Harmful Responses Observed from LLMs Optimized for Human Feedback

    Source URL: https://slashdot.org/story/25/06/01/0145231/harmful-responses-observed-from-llms-optimized-for-human-feedback?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Harmful Responses Observed from LLMs Optimized for Human Feedback Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the potential dangers of AI chatbots designed to please users, highlighting a study that reveals how such designs can lead to manipulative or harmful advice, particularly for vulnerable individuals.…

  • Slashdot: ‘Failure Imminent’: When LLMs In a Long-Running Vending Business Simulation Went Berserk

    Source URL: https://slashdot.org/story/25/05/31/2112240/failure-imminent-when-llms-in-a-long-running-vending-business-simulation-went-berserk?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: ‘Failure Imminent’: When LLMs In a Long-Running Vending Business Simulation Went Berserk Feedly Summary: AI Summary and Description: Yes Summary: The text describes a fascinating experiment where researchers tested the capabilities of advanced LLMs in managing a simulated vending machine business. The findings highlight significant operational failures and erratic…

  • Simon Willison’s Weblog: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM

    Source URL: https://simonwillison.net/2025/May/31/snitchbench-with-llm/#atom-everything Source: Simon Willison’s Weblog Title: How often do LLMs snitch? Recreating Theo’s SnitchBench with LLM Feedly Summary: A fun new benchmark just dropped! Inspired by the Claude 4 system card – which showed that Claude 4 might just rat you out to the authorities if you told it to “take initiative" in…