Claude Opus 4 – Experimental News Clipping Site

Slashdot: OpenAI Says GPT-5 Stacks Up To Humans in a Wide Range of Jobs

Sep 25, 2025

—

by

Source URL: https://slashdot.org/story/25/09/25/176219/openai-says-gpt-5-stacks-up-to-humans-in-a-wide-range-of-jobs?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: OpenAI Says GPT-5 Stacks Up To Humans in a Wide Range of Jobs Feedly Summary: AI Summary and Description: Yes Summary: OpenAI has introduced GDPval, a new benchmark to assess the performance of its AI models against that of human professionals across various industries. The benchmark indicates that models…

Simon Willison’s Weblog: CompileBench: Can AI Compile 22-year-old Code?

Sep 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/22/compilebench/ Source: Simon Willison’s Weblog Title: CompileBench: Can AI Compile 22-year-old Code? Feedly Summary: CompileBench: Can AI Compile 22-year-old Code? Interesting new LLM benchmark from Piotr Grabowski and Piotr Migdał: how well can different models handle compilation challenges such as cross-compiling gucr for ARM64 architecture? This is one of my favorite applications of…

Simon Willison’s Weblog: My review of Claude’s new Code Interpreter, released under a very confusing name

Sep 9, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/9/claude-code-interpreter/ Source: Simon Willison’s Weblog Title: My review of Claude’s new Code Interpreter, released under a very confusing name Feedly Summary: Today on the Anthropic blog: Claude can now create and edit files: Claude can now create and edit Excel spreadsheets, documents, PowerPoint slide decks, and PDFs directly in Claude.ai and the desktop…

Simon Willison’s Weblog: Anthropic status: Model output quality

Sep 9, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/9/anthropic-model-output-quality/ Source: Simon Willison’s Weblog Title: Anthropic status: Model output quality Feedly Summary: Anthropic status: Model output quality Anthropic previously reported model serving bugs that affected Claude Opus 4 and 4.1 for 56.5 hours. They’ve now fixed additional bugs affecting “a small percentage" of Sonnet 4 requests for almost a month, plus a…

Simon Willison’s Weblog: Claude Opus 4.1 and Opus 4 degraded quality

Aug 30, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/30/claude-degraded-quality/#atom-everything Source: Simon Willison’s Weblog Title: Claude Opus 4.1 and Opus 4 degraded quality Feedly Summary: Claude Opus 4.1 and Opus 4 degraded quality Notable because often when people complain of degraded model quality it turns out to be unfounded – Anthropic in the past have emphasized that they don’t change the model…

Simon Willison’s Weblog: GPT-5: Key characteristics, pricing and model card

Aug 7, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/7/gpt-5/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5: Key characteristics, pricing and model card Feedly Summary: I’ve had preview access to the new GPT-5 model family for the past two weeks, and have been using GPT-5 as my daily-driver. It’s my new favorite model. It’s still an LLM – it’s not a dramatic departure…

Simon Willison’s Weblog: Claude Opus 4.1

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/5/claude-opus-41/ Source: Simon Willison’s Weblog Title: Claude Opus 4.1 Feedly Summary: Claude Opus 4.1 Surprise new model from Anthropic today – Claude Opus 4.1, which they describe as “a drop-in replacement for Opus 4". My favorite thing about this model is the version number – treating this as a .1 version increment looks…

Simon Willison’s Weblog: GLM-4.5: Reasoning, Coding, and Agentic Abililties

Jul 28, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/28/glm-45/#atom-everything Source: Simon Willison’s Weblog Title: GLM-4.5: Reasoning, Coding, and Agentic Abililties Feedly Summary: GLM-4.5: Reasoning, Coding, and Agentic Abililties Another day, another significant new open weight model release from a Chinese frontier AI lab. This time it’s Z.ai – who rebranded (at least in English) from Zhipu AI a few months ago.…

Cloud Blog: The global endpoint offers improved availability for Anthropic’s Claude on Vertex AI

Jul 28, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/global-endpoint-for-claude-models-generally-available-on-vertex-ai/ Source: Cloud Blog Title: The global endpoint offers improved availability for Anthropic’s Claude on Vertex AI Feedly Summary: Anthropic’s Claude models on Vertex AI now have improved overall availability with the global endpoint for Claude models. Now generally available, the global endpoint unlocks the ability to dynamically route your requests to any…

Simon Willison’s Weblog: Qwen3-235B-A22B-Thinking-2507

Jul 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Jul/25/qwen3-235b-a22b-thinking-2507/#atom-everything Source: Simon Willison’s Weblog Title: Qwen3-235B-A22B-Thinking-2507 Feedly Summary: Qwen3-235B-A22B-Thinking-2507 The third Qwen model release week, following Qwen3-235B-A22B-Instruct-2507 on Monday 21st and Qwen3-Coder-480B-A35B-Instruct on Tuesday 22nd. Those two were both non-reasoning models – a change from the previous models in the Qwen 3 family which combined reasoning and non-reasoning in the same model,…

Tag: Claude Opus 4