Tag: llm-pricing

  • Simon Willison’s Weblog: Medium is the new large

    Source URL: https://simonwillison.net/2025/May/7/medium-is-the-new-large/#atom-everything Source: Simon Willison’s Weblog Title: Medium is the new large Feedly Summary: Medium is the new large New model release from Mistral – this time closed source/proprietary. Mistral Medium claims strong benchmark scores similar to GPT-4o and Claude 3.7 Sonnet, but is priced at $0.40/million input and $2/million output – about the…

  • Simon Willison’s Weblog: llm-prices.com

    Source URL: https://simonwillison.net/2025/May/7/llm-prices/#atom-everything Source: Simon Willison’s Weblog Title: llm-prices.com Feedly Summary: llm-prices.com I’ve been maintaining a simple LLM pricing calculator since October last year. I finally decided to split it out to its own domain name (previously it was hosted at tools.simonwillison.net/llm-prices), running on Cloudflare Pages. The site runs out of my simonw/llm-prices GitHub repository.…

  • Simon Willison’s Weblog: Image segmentation using Gemini 2.5

    Source URL: https://simonwillison.net/2025/Apr/18/gemini-image-segmentation/ Source: Simon Willison’s Weblog Title: Image segmentation using Gemini 2.5 Feedly Summary: Max Woolf pointed out this new feature of the Gemini 2.5 series in a comment on Hacker News: One hidden note from Gemini 2.5 Flash when diving deep into the documentation: for image inputs, not only can the model be…

  • Simon Willison’s Weblog: Start building with Gemini 2.5 Flash

    Source URL: https://simonwillison.net/2025/Apr/17/start-building-with-gemini-25-flash/ Source: Simon Willison’s Weblog Title: Start building with Gemini 2.5 Flash Feedly Summary: Start building with Gemini 2.5 Flash Google Gemini’s latest model is Gemini 2.5 Flash, available in (paid) preview as gemini-2.5-flash-preview-04-17. Building upon the popular foundation of 2.0 Flash, this new version delivers a major upgrade in reasoning capabilities, while…

  • Simon Willison’s Weblog: Introducing OpenAI o3 and o4-mini

    Source URL: https://simonwillison.net/2025/Apr/16/introducing-openai-o3-and-o4-mini/ Source: Simon Willison’s Weblog Title: Introducing OpenAI o3 and o4-mini Feedly Summary: Introducing OpenAI o3 and o4-mini OpenAI are really emphasizing tool use with these: For the first time, our reasoning models can agentically use and combine every tool within ChatGPT—this includes searching the web, analyzing uploaded files and other data with…

  • Simon Willison’s Weblog: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet

    Source URL: https://simonwillison.net/2025/Apr/14/gpt-4-1/ Source: Simon Willison’s Weblog Title: GPT-4.1: Three new million token input models from OpenAI, including their cheapest model yet Feedly Summary: OpenAI introduced three new models this morning: GPT-4.1, GPT-4.1 mini and GPT-4.1 nano. These are API-only models right now, not available through the ChatGPT interface (though you can try them out…

  • Simon Willison’s Weblog: LLM pricing calculator (updated)

    Source URL: https://simonwillison.net/2025/Apr/10/llm-pricing-calculator/#atom-everything Source: Simon Willison’s Weblog Title: LLM pricing calculator (updated) Feedly Summary: LLM pricing calculator (updated) I updated my LLM pricing calculator this morning (Claude transcript) to show the prices of various hosted models in a sorted table, defaulting to lowest price first. Amazon Nova and Google Gemini continue to dominate the lower…

  • Simon Willison’s Weblog: Gemini 2.5 Pro Preview pricing

    Source URL: https://simonwillison.net/2025/Apr/4/gemini-25-pro-pricing/ Source: Simon Willison’s Weblog Title: Gemini 2.5 Pro Preview pricing Feedly Summary: Gemini 2.5 Pro Preview pricing Google’s Gemini 2.5 Pro is currently the top model on LM Arena and, from my own testing, a superb model for OCR, audio transcription and long-context coding. You can now pay for it! The new…

  • Simon Willison’s Weblog: OpenAI platform: o1-pro

    Source URL: https://simonwillison.net/2025/Mar/19/o1-pro/ Source: Simon Willison’s Weblog Title: OpenAI platform: o1-pro Feedly Summary: OpenAI platform: o1-pro OpenAI have a new most-expensive model: o1-pro can now be accessed through their API at a hefty $150/million tokens for input and $600/million tokens for output. That’s 10x the price of their o1 and o1-preview models and a full…

  • Simon Willison’s Weblog: Gemini 2.0 Flash and Flash-Lite

    Source URL: https://simonwillison.net/2025/Feb/25/gemini-20-flash-and-flash-lite/ Source: Simon Willison’s Weblog Title: Gemini 2.0 Flash and Flash-Lite Feedly Summary: Gemini 2.0 Flash and Flash-Lite Gemini 2.0 Flash-Lite is now generally available – previously it was available just as a preview – and has announced pricing. The model is $0.075/million input tokens and $0.030/million output – the same price as…