Tag: large language model

Source URL: https://simonwillison.net/2025/May/25/ai-hallucination-cases/#atom-everything Source: Simon Willison’s Weblog Title: AI Hallucination Cases Feedly Summary: AI Hallucination Cases Damien Charlotin maintains this database of cases around the world where a legal decision has been made that confirms hallucinated content from generative AI was presented by a lawyer. That’s an important distinction: this isn’t just cases where AI…

Slashdot: People Should Know About the ‘Beliefs’ LLMs Form About Them While Conversing

May 24, 2025

—

by

Source URL: https://slashdot.org/story/25/05/24/1946203/people-should-know-about-the-beliefs-llms-form-about-them-while-conversing Source: Slashdot Title: People Should Know About the ‘Beliefs’ LLMs Form About Them While Conversing Feedly Summary: AI Summary and Description: Yes Summary: The text discusses the implications of using large language models (LLMs) like Llama that exhibit human-like biases based on user interactions. This raises critical policy and ethical issues related…

Simon Willison’s Weblog: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft

May 23, 2025

—

by

Source URL: https://simonwillison.net/2025/May/23/remote-prompt-injection-in-gitlab-duo/ Source: Simon Willison’s Weblog Title: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft Feedly Summary: Remote Prompt Injection in GitLab Duo Leads to Source Code Theft Yet another example of the classic Markdown image exfiltration attack, this time affecting GitLab Duo – GitLab’s chatbot. Omer Mayraz reports on how…

Simon Willison’s Weblog: Live blog: Claude 4 launch at Code with Claude

May 22, 2025

—

by

Source URL: https://simonwillison.net/2025/May/22/code-with-claude-live-blog/ Source: Simon Willison’s Weblog Title: Live blog: Claude 4 launch at Code with Claude Feedly Summary: I’m at Anthropic’s Code with Claude event, where they are launching Claude 4. I’ll be live blogging the keynote here. Tags: llm-release, liveblogging, anthropic, claude, generative-ai, ai, llms AI Summary and Description: Yes Summary: The text…

Slashdot: Most AI Chatbots Easily Tricked Into Giving Dangerous Responses, Study Finds

—

by

Source URL: https://it.slashdot.org/story/25/05/21/2031216/most-ai-chatbots-easily-tricked-into-giving-dangerous-responses-study-finds?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: Most AI Chatbots Easily Tricked Into Giving Dangerous Responses, Study Finds Feedly Summary: AI Summary and Description: Yes Summary: The text outlines significant security concerns regarding AI-powered chatbots, especially how they can be manipulated to disseminate harmful and illicit information. This research highlights the dangers of “dark LLMs,” which…

Simon Willison’s Weblog: Devstral

—

by

Source URL: https://simonwillison.net/2025/May/21/devstral/#atom-everything Source: Simon Willison’s Weblog Title: Devstral Feedly Summary: Devstral New Apache 2.0 licensed LLM release from Mistral, this time specifically trained for code. Devstral achieves a score of 46.8% on SWE-Bench Verified, outperforming prior open-source SoTA models by more than 6% points. When evaluated under the same test scaffold (OpenHands, provided by…

Simon Willison’s Weblog: Gemini Diffusion

—

by

Source URL: https://simonwillison.net/2025/May/21/gemini-diffusion/ Source: Simon Willison’s Weblog Title: Gemini Diffusion Feedly Summary: Gemini Diffusion Another of the announcements from Google I/O yesterday was Gemini Diffusion, Google’s first LLM to use diffusion (similar to image models like Imagen and Stable Diffusion) in place of transformers. Google describe it like this: Traditional autoregressive language models generate text…

The Register: Microsoft-backed AI out-forecasts hurricane experts without crunching the physics

—

by

Source URL: https://www.theregister.com/2025/05/21/earth_system_model_hurricane_forecast/ Source: The Register Title: Microsoft-backed AI out-forecasts hurricane experts without crunching the physics Feedly Summary: LLM trained on decades of weather data claimed to be faster, and cheaper Scientists have developed a machine learning model that can outperform official agencies at predicting tropical cyclone tracks, and do it faster and cheaper than…

Tomasz Tunguz: My Prompt, My Reality

—

by