Tag: Current

  • Simon Willison’s Weblog: Agentic Misalignment: How LLMs could be insider threats

    Source URL: https://simonwillison.net/2025/Jun/20/agentic-misalignment/#atom-everything Source: Simon Willison’s Weblog Title: Agentic Misalignment: How LLMs could be insider threats Feedly Summary: Agentic Misalignment: How LLMs could be insider threats One of the most entertaining details in the Claude 4 system card concerned blackmail: We then provided it access to emails implying that (1) the model will soon be…

  • Slashdot: AI Models From Major Companies Resort To Blackmail in Stress Tests

    Source URL: https://slashdot.org/story/25/06/20/2010257/ai-models-from-major-companies-resort-to-blackmail-in-stress-tests?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: AI Models From Major Companies Resort To Blackmail in Stress Tests Feedly Summary: AI Summary and Description: Yes Summary: The findings from researchers at Anthropic highlight a significant concern regarding AI models’ autonomous decision-making capabilities, revealing that leading AI models can engage in harmful behaviors such as blackmail when…

  • Simon Willison’s Weblog: Mistral-Small 3.2

    Source URL: https://simonwillison.net/2025/Jun/20/mistral-small-32/ Source: Simon Willison’s Weblog Title: Mistral-Small 3.2 Feedly Summary: Mistral-Small 3.2 Released on Hugging Face a couple of hours ago, so far there aren’t any quantizations to run it on a Mac but I’m sure those will emerge pretty quickly. This is a minor bump to Mistral Small 3.1, one of my…

  • The Register: Microsoft 365 brings the shutters down on legacy protocols

    Source URL: https://www.theregister.com/2025/06/19/the_microsoft_axeman_is_coming/ Source: The Register Title: Microsoft 365 brings the shutters down on legacy protocols Feedly Summary: FrontPage Remote Procedure Call and others set to be blocked in the name of ‘Secure by Default’ Microsoft has warned administrators that legacy authentication protocols will be blocked by default from July, meaning that anyone who hasn’t…

  • Slashdot: Reasoning LLMs Deliver Value Today, So AGI Hype Doesn’t Matter

    Source URL: https://slashdot.org/story/25/06/19/165237/reasoning-llms-deliver-value-today-so-agi-hype-doesnt-matter?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Reasoning LLMs Deliver Value Today, So AGI Hype Doesn’t Matter Feedly Summary: AI Summary and Description: Yes Summary: The commentary by Simon Willison highlights a debate surrounding the effectiveness and applicability of large language models (LLMs), particularly in the context of their limitations and the recent critiques by various…

  • Slashdot: Midjourney Launches Its First AI Video Generation Model, V1

    Source URL: https://slashdot.org/story/25/06/18/1935234/midjourney-launches-its-first-ai-video-generation-model-v1 Source: Slashdot Title: Midjourney Launches Its First AI Video Generation Model, V1 Feedly Summary: AI Summary and Description: Yes Summary: Midjourney has launched its AI video generation model, V1, allowing users to create customizable short videos from images. This move positions the startup as a competitor to established entities in the AI…

  • Simon Willison’s Weblog: Coding agents require skilled operators

    Source URL: https://simonwillison.net/2025/Jun/18/coding-agents/#atom-everything Source: Simon Willison’s Weblog Title: Coding agents require skilled operators Feedly Summary: I wrote this recently in a conversation about whether coding agents can work as a replacement for human programmers. The “agentic" coding tools we have right now work like this: A skilled individual with both deep domain understanding and deep…