Tag: max

  • Simon Willison’s Weblog: XBai o4

    Source URL: https://simonwillison.net/2025/Aug/3/xbai-o4/#atom-everything Source: Simon Willison’s Weblog Title: XBai o4 Feedly Summary: XBai o4 Yet another open source (Apache 2.0) LLM from a Chinese AI lab. This model card claims: XBai o4 excels in complex reasoning capabilities and has now completely surpassed OpenAI-o3-mini in Medium mode. This a 32.8 billion parameter model released by MetaStone…

  • Simon Willison’s Weblog: Faster inference

    Source URL: https://simonwillison.net/2025/Aug/1/faster-inference/ Source: Simon Willison’s Weblog Title: Faster inference Feedly Summary: Two interesting examples of inference speed as a flagship feature of LLM services today. First, Cerebras announced two new monthly plans for their extremely high speed hosted model service: Cerebras Code Pro ($50/month, 1,000 messages a day) and Cerebras Code Max ($200/month, 5,000/day).…

  • Simon Willison’s Weblog: Reverse engineering some updates to Claude

    Source URL: https://simonwillison.net/2025/Jul/31/updates-to-claude/#atom-everything Source: Simon Willison’s Weblog Title: Reverse engineering some updates to Claude Feedly Summary: Anthropic released two major new features for their consumer-facing Claude apps in the past couple of days. Sadly, they don’t do a very good job of updating the release notes for those apps – neither of these releases came…

  • Cloud Blog: Now GA: C4 VMs with Local SSD, bare metal, and larger shapes, on Intel Xeon 6

    Source URL: https://cloud.google.com/blog/products/compute/c4-vms-based-on-intel-6th-gen-xeon-granite-rapids-now-ga/ Source: Cloud Blog Title: Now GA: C4 VMs with Local SSD, bare metal, and larger shapes, on Intel Xeon 6 Feedly Summary: We’re thrilled to announce a significant expansion of our C4 virtual machine series, with the general availability of 28 powerful new shapes. This expansion introduces C4 shapes with Google’s next-gen…

  • Cloud Blog: Announcing a complete developer toolkit for scaling A2A agents on Google Cloud

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/agent2agent-protocol-is-getting-an-upgrade/ Source: Cloud Blog Title: Announcing a complete developer toolkit for scaling A2A agents on Google Cloud Feedly Summary: AI is  evolving beyond single, task-specific agents into an interconnected ecosystem, where autonomous agents collaborate  to solve complex problems, regardless of their underlying platform. To make this transition easier for developers, we are announcing…

  • Simon Willison’s Weblog: My 2.5 year old laptop can write Space Invaders in JavaScript now

    Source URL: https://simonwillison.net/2025/Jul/29/space-invaders/ Source: Simon Willison’s Weblog Title: My 2.5 year old laptop can write Space Invaders in JavaScript now Feedly Summary: I wrote about the new GLM-4.5 model family yesterday – new open weight (MIT licensed) models from Z.ai in China which their benchmarks claim score highly in coding even against models such as…

  • Simon Willison’s Weblog: Quoting Anthropic

    Source URL: https://simonwillison.net/2025/Jul/28/anthropic/ Source: Simon Willison’s Weblog Title: Quoting Anthropic Feedly Summary: We’re rolling out new weekly rate limits for Claude Pro and Max in late August. We estimate they’ll apply to less than 5% of subscribers based on current usage. […] Some of the biggest Claude Code fans are running it continuously in the…

  • Cloud Blog: The global endpoint offers improved availability for Anthropic’s Claude on Vertex AI

    Source URL: https://cloud.google.com/blog/products/ai-machine-learning/global-endpoint-for-claude-models-generally-available-on-vertex-ai/ Source: Cloud Blog Title: The global endpoint offers improved availability for Anthropic’s Claude on Vertex AI Feedly Summary: Anthropic’s Claude models on Vertex AI now have improved overall availability with the global endpoint for Claude models. Now generally available, the global endpoint unlocks the ability to dynamically route your requests to any…

  • Cloud Blog: Too many threats, too much data, say security and IT leaders. Here’s how to fix that

    Source URL: https://cloud.google.com/blog/products/identity-security/too-many-threats-too-much-data-new-survey-heres-how-to-fix-that/ Source: Cloud Blog Title: Too many threats, too much data, say security and IT leaders. Here’s how to fix that Feedly Summary: An overwhelming volume of threats and data combined with the shortage of skilled threat analysts has left many security and IT leaders believing that their organizations are vulnerable to cyberattacks…