Tag: Mistral
-
Simon Willison’s Weblog: GLM-4.5: Reasoning, Coding, and Agentic Abililties
Source URL: https://simonwillison.net/2025/Jul/28/glm-45/#atom-everything Source: Simon Willison’s Weblog Title: GLM-4.5: Reasoning, Coding, and Agentic Abililties Feedly Summary: GLM-4.5: Reasoning, Coding, and Agentic Abililties Another day, another significant new open weight model release from a Chinese frontier AI lab. This time it’s Z.ai – who rebranded (at least in English) from Zhipu AI a few months ago.…
-
Simon Willison’s Weblog: Our contribution to a global environmental standard for AI
Source URL: https://simonwillison.net/2025/Jul/22/mistral-environmental-standard/ Source: Simon Willison’s Weblog Title: Our contribution to a global environmental standard for AI Feedly Summary: Our contribution to a global environmental standard for AI Mistral have released environmental impact numbers for their largest model, Mistral Large 2, in more detail than I have seen from any of the other large AI…
-
Simon Willison’s Weblog: Voxtral
Source URL: https://simonwillison.net/2025/Jul/16/voxtral/#atom-everything Source: Simon Willison’s Weblog Title: Voxtral Feedly Summary: Voxtral Mistral released their first audio-input models yesterday: Voxtral Small and Voxtral Mini. These state‑of‑the‑art speech understanding models are available in two sizes—a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache…
-
Cloud Blog: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough
Source URL: https://cloud.google.com/blog/topics/developers-practitioners/implementing-high-performance-llm-serving-on-gke-an-inference-gateway-walkthrough/ Source: Cloud Blog Title: Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough Feedly Summary: The excitement around open Large Language Models like Gemma, Llama, Mistral, and Qwen is evident, but developers quickly hit a wall. How do you deploy them effectively at scale? Traditional load balancing algorithms fall short, as…
-
The Register: EU businesses want a pause on AI regulations so they can cope with unregulated Big Tech players
Source URL: https://www.theregister.com/2025/07/04/eu_businesses_push_for_freedom/ Source: The Register Title: EU businesses want a pause on AI regulations so they can cope with unregulated Big Tech players Feedly Summary: Mistral fears continental companies may not get time to escape ‘distant, behemoth corporations’ French AI business Mistral on Thursday announced an initiative called “AI for Citizens,” which it says…
-
Simon Willison’s Weblog: model.yaml
Source URL: https://simonwillison.net/2025/Jun/21/model-yaml/#atom-everything Source: Simon Willison’s Weblog Title: model.yaml Feedly Summary: model.yaml From their GitHub repo it looks like this effort quietly launched a couple of months ago, driven by the LM Studio team. Their goal is to specify an “open standard for defining crossplatform, composable AI models". A model can be defined using a…
-
Simon Willison’s Weblog: Mistral-Small 3.2
Source URL: https://simonwillison.net/2025/Jun/20/mistral-small-32/ Source: Simon Willison’s Weblog Title: Mistral-Small 3.2 Feedly Summary: Mistral-Small 3.2 Released on Hugging Face a couple of hours ago, so far there aren’t any quantizations to run it on a Mac but I’m sure those will emerge pretty quickly. This is a minor bump to Mistral Small 3.1, one of my…
-
Simon Willison’s Weblog: The lethal trifecta for AI agents: private data, untrusted content, and external communication
Source URL: https://simonwillison.net/2025/Jun/16/the-lethal-trifecta/#atom-everything Source: Simon Willison’s Weblog Title: The lethal trifecta for AI agents: private data, untrusted content, and external communication Feedly Summary: If you are a user of LLM systems that use tools (you can call them “AI agents" if you like) it is critically important that you understand the risk of combining tools…
-
Simon Willison’s Weblog: Magistral — the first reasoning model by Mistral AI
Source URL: https://simonwillison.net/2025/Jun/10/magistral/ Source: Simon Willison’s Weblog Title: Magistral — the first reasoning model by Mistral AI Feedly Summary: Magistral — the first reasoning model by Mistral AI Mistral’s first reasoning model is out today, in two sizes. There’s a 24B Apache 2 licensed open-weights model called Magistral Small (actually Magistral-Small-2506), and a larger API-only…
-
Simon Willison’s Weblog: The last year six months in LLMs, illustrated by pelicans on bicycles
Source URL: https://simonwillison.net/2025/Jun/6/six-months-in-llms/#atom-everything Source: Simon Willison’s Weblog Title: The last year six months in LLMs, illustrated by pelicans on bicycles Feedly Summary: I presented an invited keynote at the AI Engineer World’s Fair in San Francisco this week. This is my third time speaking at the event – here’s my talks from October 2023 and…