Tag: benchmark analysis
-
Simon Willison’s Weblog: Open weight LLMs exhibit inconsistent performance across providers
Source URL: https://simonwillison.net/2025/Aug/15/inconsistent-performance/ Source: Simon Willison’s Weblog Title: Open weight LLMs exhibit inconsistent performance across providers Feedly Summary: Artificial Analysis published a new benchmark the other day, this time focusing on how an individual model – OpenAI’s gpt-oss-120b – performs across different hosted providers. The results showed some surprising differences. Here’s the one with the…