Opus – Experimental News Clipping Site

Simon Willison’s Weblog: Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now)

Sep 29, 2025

—

by

Source URL: https://simonwillison.net/2025/Sep/29/claude-sonnet-4-5/ Source: Simon Willison’s Weblog Title: Claude Sonnet 4.5 is probably the "best coding model in the world" (at least for now) Feedly Summary: Anthropic released Claude Sonnet 4.5 today, with a very bold set of claims: Claude Sonnet 4.5 is the best coding model in the world. It’s the strongest model for…

Slashdot: New Claude Model Runs 30-Hour Marathon To Create 11,000-Line Slack Clone

Sep 29, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://developers.slashdot.org/story/25/09/29/1733238/new-claude-model-runs-30-hour-marathon-to-create-11000-line-slack-clone?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: New Claude Model Runs 30-Hour Marathon To Create 11,000-Line Slack Clone Feedly Summary: AI Summary and Description: Yes Summary: Anthropic’s release of Claude Sonnet 4.5 marks a significant advancement in autonomous AI capabilities, particularly in code generation and application development. This model can substantially improve productivity for developers by…

Slashdot: OpenAI Says GPT-5 Stacks Up To Humans in a Wide Range of Jobs

Sep 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://slashdot.org/story/25/09/25/176219/openai-says-gpt-5-stacks-up-to-humans-in-a-wide-range-of-jobs?utm_source=rss1.0mainlinkanon&utm_medium=feed Source: Slashdot Title: OpenAI Says GPT-5 Stacks Up To Humans in a Wide Range of Jobs Feedly Summary: AI Summary and Description: Yes Summary: OpenAI has introduced GDPval, a new benchmark to assess the performance of its AI models against that of human professionals across various industries. The benchmark indicates that models…

Simon Willison’s Weblog: CompileBench: Can AI Compile 22-year-old Code?

Sep 22, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/22/compilebench/ Source: Simon Willison’s Weblog Title: CompileBench: Can AI Compile 22-year-old Code? Feedly Summary: CompileBench: Can AI Compile 22-year-old Code? Interesting new LLM benchmark from Piotr Grabowski and Piotr Migdał: how well can different models handle compilation challenges such as cross-compiling gucr for ARM64 architecture? This is one of my favorite applications of…

Simon Willison’s Weblog: My review of Claude’s new Code Interpreter, released under a very confusing name

Sep 9, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/9/claude-code-interpreter/ Source: Simon Willison’s Weblog Title: My review of Claude’s new Code Interpreter, released under a very confusing name Feedly Summary: Today on the Anthropic blog: Claude can now create and edit files: Claude can now create and edit Excel spreadsheets, documents, PowerPoint slide decks, and PDFs directly in Claude.ai and the desktop…

Simon Willison’s Weblog: Anthropic status: Model output quality

Sep 9, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Sep/9/anthropic-model-output-quality/ Source: Simon Willison’s Weblog Title: Anthropic status: Model output quality Feedly Summary: Anthropic status: Model output quality Anthropic previously reported model serving bugs that affected Claude Opus 4 and 4.1 for 56.5 hours. They’ve now fixed additional bugs affecting “a small percentage" of Sonnet 4 requests for almost a month, plus a…

Simon Willison’s Weblog: Claude Opus 4.1 and Opus 4 degraded quality

Aug 30, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/30/claude-degraded-quality/#atom-everything Source: Simon Willison’s Weblog Title: Claude Opus 4.1 and Opus 4 degraded quality Feedly Summary: Claude Opus 4.1 and Opus 4 degraded quality Notable because often when people complain of degraded model quality it turns out to be unfounded – Anthropic in the past have emphasized that they don’t change the model…

Simon Willison’s Weblog: GPT-5: Key characteristics, pricing and model card

Aug 7, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/7/gpt-5/#atom-everything Source: Simon Willison’s Weblog Title: GPT-5: Key characteristics, pricing and model card Feedly Summary: I’ve had preview access to the new GPT-5 model family for the past two weeks, and have been using GPT-5 as my daily-driver. It’s my new favorite model. It’s still an LLM – it’s not a dramatic departure…

Simon Willison’s Weblog: Claude Opus 4.1

Aug 5, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://simonwillison.net/2025/Aug/5/claude-opus-41/ Source: Simon Willison’s Weblog Title: Claude Opus 4.1 Feedly Summary: Claude Opus 4.1 Surprise new model from Anthropic today – Claude Opus 4.1, which they describe as “a drop-in replacement for Opus 4". My favorite thing about this model is the version number – treating this as a .1 version increment looks…

Slashdot: Claude Code Users Hit With Weekly Rate Limits

Jul 29, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://developers.slashdot.org/story/25/07/29/0156200/claude-code-users-hit-with-weekly-rate-limits Source: Slashdot Title: Claude Code Users Hit With Weekly Rate Limits Feedly Summary: AI Summary and Description: Yes Summary: Anthropic’s decision to implement weekly rate limits for Claude subscribers is a significant shift aimed at managing resource consumption and preventing account sharing. This move highlights the growing challenges in scaling AI services…

Tag: Opus