Simon Willison’s Weblog: Medium is the new large

Source URL: https://simonwillison.net/2025/May/7/medium-is-the-new-large/#atom-everything
Source: Simon Willison’s Weblog
Title: Medium is the new large

Feedly Summary: Medium is the new large
New model release from Mistral – this time closed source/proprietary. Mistral Medium claims strong benchmark scores similar to GPT-4o and Claude 3.7 Sonnet, but is priced at $0.40/million input and $2/million output – about the same price as GPT 4.1 Mini. For comparison, GPT-4o is $2.50/$10 and Claude 3.7 Sonnet is $3/$15.
More interesting than the price is the deployment model. Mistral Medium may not be open weights but it is very much available for self-hosting:

Mistral Medium 3 can also be deployed on any cloud, including self-hosted environments of four GPUs and above.

Mistral’s other announcement today is Le Chat Enterprise. This is a suite of tools that can integrate with your company’s internal data and provide “agents" (these look similar to Claude Projects or OpenAI GPTs), again with the option to self-host.
Is there a new open weights model coming soon? This note tucked away at the bottom of the Mistral Medium 3 announcement seems to hint at that:

With the launches of Mistral Small in March and Mistral Medium today, it’s no secret that we’re working on something ‘large’ over the next few weeks. With even our medium-sized model being resoundingly better than flagship open source models such as Llama 4 Maverick, we’re excited to ‘open’ up what’s to come 🙂

Tags: llm-release, mistral, generative-ai, ai, llms, llm-pricing

AI Summary and Description: Yes

Summary: The text discusses the new release of Mistral Medium, a proprietary AI model that boasts benchmark scores comparable to leading models like GPT-4o and Claude 3.7 Sonnet. It highlights Mistral Medium’s pricing structure and deployment flexibility, including self-hosting options, making it relevant for professionals dealing with AI security, cloud computing, and infrastructure.

Detailed Description: The recent announcement of Mistral Medium introduces a proprietary model in the AI landscape with notable implications for security and deployment in various environments.

– **Model Overview**:
– Mistral Medium claims strong performance benchmarks similar to GPT-4o and Claude 3.7 Sonnet.
– Priced competitively at $0.40/million input and $2/million output, mirroring the pricing strategy of GPT 4.1 Mini.

– **Deployment Options**:
– The model is available for self-hosting, requiring only a minimum setup of four GPUs.
– This flexibility allows organizations to deploy the AI model in their own cloud environments, facilitating greater control over data privacy and security.

– **Additional Offerings**:
– Mistral’s Le Chat Enterprise suite provides tools for integration with internal company data, creating ‘agents’ for specialized tasks.
– Offers self-hosting capabilities to enhance data security and compliance.

– **Future Developments**:
– Hints at forthcoming model releases, implying potential advancements in open-source models, which could further influence the landscape for enterprise AI applications.

The relevance of this information lies in its implications for security and compliance professionals, particularly concerning data governance, deployment strategies, and ensuring the integrity of proprietary AI models in organizational settings.