Tag: authors

  • Simon Willison’s Weblog: AbsenceBench: Language Models Can’t Tell What’s Missing

    Source URL: https://simonwillison.net/2025/Jun/20/absencebench/#atom-everything Source: Simon Willison’s Weblog Title: AbsenceBench: Language Models Can’t Tell What’s Missing Feedly Summary: AbsenceBench: Language Models Can’t Tell What’s Missing Here’s another interesting result to file under the “jagged frontier" of LLMs, where their strengths and weaknesses are often unintuitive. Long context models have been getting increasingly good at passing "Needle…

  • Slashdot: How Do Olympiad Medalists Judge LLMs in Competitive Programming?

    Source URL: https://slashdot.org/story/25/06/17/149238/how-do-olympiad-medalists-judge-llms-in-competitive-programming?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: How Do Olympiad Medalists Judge LLMs in Competitive Programming? Feedly Summary: AI Summary and Description: Yes Summary: The text discusses a newly established benchmark demonstrating that large language models (LLMs) are not yet capable of outperforming elite human coders, particularly in problem-solving contexts. The findings indicate limitations in the…

  • Slashdot: Meta’s Llama 3.1 Can Recall 42% of the First Harry Potter Book

    Source URL: https://slashdot.org/story/25/06/15/2230206/metas-llama-31-can-recall-42-of-the-first-harry-potter-book?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Meta’s Llama 3.1 Can Recall 42% of the First Harry Potter Book Feedly Summary: AI Summary and Description: Yes Summary: The text discusses significant findings from a research study that highlights the memorization capabilities of Llama 3.1 70B, an AI model from Meta. It raises concerns about potential legal…

  • Slashdot: ‘AI Is Not Intelligent’: The Atlantic Criticizes ‘Scam’ Underlying the AI Industry

    Source URL: https://slashdot.org/story/25/06/09/062257/ai-is-not-intelligent-the-atlantic-criticizes-scam-underlying-the-ai-industry?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: ‘AI Is Not Intelligent’: The Atlantic Criticizes ‘Scam’ Underlying the AI Industry Feedly Summary: AI Summary and Description: Yes Summary: The text critically examines the portrayal of AI, particularly large language models (LLMs), as capable of understanding and emotional intelligence. It highlights misconceptions about AI’s capabilities and warns against…

  • Simon Willison’s Weblog: Tips on prompting ChatGPT for UK technology secretary Peter Kyle

    Source URL: https://simonwillison.net/2025/Jun/3/tips-for-peter-kyle/#atom-everything Source: Simon Willison’s Weblog Title: Tips on prompting ChatGPT for UK technology secretary Peter Kyle Feedly Summary: Back in March New Scientist reported on a successful Freedom of Information request they had filed requesting UK Secretary of State for Science, Innovation and Technology Peter Kyle’s ChatGPT logs: New Scientist has obtained records…

  • Simon Willison’s Weblog: Shisa V2 405B: Japan’s Highest Performing LLM

    Source URL: https://simonwillison.net/2025/Jun/3/shisa-v2/ Source: Simon Willison’s Weblog Title: Shisa V2 405B: Japan’s Highest Performing LLM Feedly Summary: Shisa V2 405B: Japan’s Highest Performing LLM Leonard Lin and Adam Lensenmayer have been working on Shisa for a while. They describe their latest release as “Japan’s Highest Performing LLM". Shisa V2 405B is the highest-performing LLM ever…

  • Slashdot: Football and Other Premium TV Being Pirated At ‘Industrial Scale’

    Source URL: https://yro.slashdot.org/story/25/05/31/0029226/football-and-other-premium-tv-being-pirated-at-industrial-scale?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Football and Other Premium TV Being Pirated At ‘Industrial Scale’ Feedly Summary: AI Summary and Description: Yes Summary: The report highlights the significant shortcomings of major tech firms in preventing the theft of premium video services through devices like the Amazon Fire Stick, which have become enablers of piracy.…

  • Simon Willison’s Weblog: llm-mistral 0.14

    Source URL: https://simonwillison.net/2025/May/29/llm-mistral-014/#atom-everything Source: Simon Willison’s Weblog Title: llm-mistral 0.14 Feedly Summary: llm-mistral 0.14 I added tool-support to my plugin for accessing the Mistral API from LLM today, plus support for Mistral’s new Codestral Embed embedding model. An interesting challenge here is that I’m not using an official client library for llm-mistral – I rolled…