Simon Willison’s Weblog: Medium is the new large

May 7, 2025

—

Source URL: https://simonwillison.net/2025/May/7/medium-is-the-new-large/#atom-everything
Source: Simon Willison’s Weblog
Title: Medium is the new large

Feedly Summary: Medium is the new large
New model release from Mistral – this time closed source/proprietary. Mistral Medium claims strong benchmark scores similar to GPT-4o and Claude 3.7 Sonnet, but is priced at $0.40/million input and $2/million output – about the same price as GPT 4.1 Mini. For comparison, GPT-4o is $2.50/$10 and Claude 3.7 Sonnet is $3/$15.
More interesting than the price is the deployment model. Mistral Medium may not be open weights but it is very much available for self-hosting:

Mistral Medium 3 can also be deployed on any cloud, including self-hosted environments of four GPUs and above.

Mistral’s other announcement today is Le Chat Enterprise. This is a suite of tools that can integrate with your company’s internal data and provide “agents" (these look similar to Claude Projects or OpenAI GPTs), again with the option to self-host.
Is there a new open weights model coming soon? This note tucked away at the bottom of the Mistral Medium 3 announcement seems to hint at that:

With the launches of Mistral Small in March and Mistral Medium today, it’s no secret that we’re working on something ‘large’ over the next few weeks. With even our medium-sized model being resoundingly better than flagship open source models such as Llama 4 Maverick, we’re excited to ‘open’ up what’s to come 🙂

Tags: llm-release, mistral, generative-ai, ai, llms, llm-pricing

AI Summary and Description: Yes

Summary: The text discusses the new release of Mistral Medium, a proprietary AI model that boasts benchmark scores comparable to leading models like GPT-4o and Claude 3.7 Sonnet. It highlights Mistral Medium’s pricing structure and deployment flexibility, including self-hosting options, making it relevant for professionals dealing with AI security, cloud computing, and infrastructure.

Detailed Description: The recent announcement of Mistral Medium introduces a proprietary model in the AI landscape with notable implications for security and deployment in various environments.

– **Model Overview**:
– Mistral Medium claims strong performance benchmarks similar to GPT-4o and Claude 3.7 Sonnet.
– Priced competitively at $0.40/million input and $2/million output, mirroring the pricing strategy of GPT 4.1 Mini.

– **Deployment Options**:
– The model is available for self-hosting, requiring only a minimum setup of four GPUs.
– This flexibility allows organizations to deploy the AI model in their own cloud environments, facilitating greater control over data privacy and security.

– **Additional Offerings**:
– Mistral’s Le Chat Enterprise suite provides tools for integration with internal company data, creating ‘agents’ for specialized tasks.
– Offers self-hosting capabilities to enhance data security and compliance.

– **Future Developments**:
– Hints at forthcoming model releases, implying potential advancements in open-source models, which could further influence the landscape for enterprise AI applications.

The relevance of this information lies in its implications for security and compliance professionals, particularly concerning data governance, deployment strategies, and ensuring the integrity of proprietary AI models in organizational settings.

-4o .NET 1 10 2 2025 3 4 5 7 7 Sonnet a advancement advancements agent agents AI AI applications AI landscape ai model AI models AI security and app Application applications Arch art as being benchmark benchmarks Bi C capabilities CERN chat CI CIA Claude closed Cloud cloud computing cloud environment cloud environments co competitive compliance compliance professionals Computing control core D data data governance data privacy data security day de deployment deployment flexibility deployment options deployment strategies development developments e E 3 enterprise environment ERP flexibility for future future developments g Gen generative Go governance GPT GPT-4o GPU GPUs gs H high Highlight hosted hosting hosting option http HTTPS implications in Influence information infrastructure integration integrity inter intern Iron ite J k l land large led Li llama Llama 4 llm llm-pricing llms lm low M making man Mila mini Mir Mistral Mode model model releases models N next no NPU o of off on only open open weights open-source open-source models openai OPM opt options organization organizations out output over performance performance benchmark performance benchmarks potential price pricing pricing strategy pricing structure privacy professionals project projects proprietary Proprietary model Q R rate RCE release releases Ro s sam sec security security and compliance self self-hosting settings Sim size source source models specialized SSE strategies Strategy T Tags: Task tasks text the Time to tool tools TP UI up US V web Wi x