Cloud Blog: Announcing Anthropic’s Claude Opus 4 and Claude Sonnet 4 on Vertex AI

May 22, 2025

—

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/anthropics-claude-opus-4-and-claude-sonnet-4-on-vertex-ai/
Source: Cloud Blog
Title: Announcing Anthropic’s Claude Opus 4 and Claude Sonnet 4 on Vertex AI

Feedly Summary: Today, we’re expanding the choice of third-party models available in Vertex AI Model Garden with the addition of Anthropic’s newest generation of the Claude model family: Claude Opus 4 and Claude Sonnet 4. Both Claude Opus 4 and Claude Sonnet 4 are hybrid reasoning models, meaning they offer modes for near-instant responses and extended thinking for deeper reasoning.
Claude Opus 4 is Anthropic’s most powerful model to date. Claude Opus 4 excels at coding, with sustained performance on complex, long-running tasks and agent workflows. Use cases include advanced coding work, autonomous AI agents, agentic search and research, tasks that require complex problem solving, and long-running tasks that require precise content management.
Claude Sonnet 4 is Anthropic’s mid-size model that balances performance with cost. It surpasses its predecessor, Claude Sonnet 3.7, across coding and reasoning while responding more precisely to steering. Use cases include coding tasks such as code reviews and bug fixes, AI assistants, efficient research, and large-scale content generation and analysis.
Claude Opus 4 and Claude Sonnet 4 are generally available as a Model-as-a-Service (MaaS) offering on Vertex AI. For more information on the newest Claude models, visit Anthropic’s blog.
Build advanced agents on Vertex AI
Vertex AI is Google Cloud’s comprehensive platform for orchestrating your production AI workflows across three pillars: data, models, and agents—a combination that would otherwise require multiple fragmented solutions. A key component of the model pillar is Vertex AI Model Garden, which offers a curated selection of over 200 foundation models, including Google’s models, third-party models, and open models—empowering you to choose the ideal solution for your specific needs.
You can leverage Vertex AI’s Model-as-a-Service (MaaS) to rapidly deploy and scale Claude-powered intelligent agents and applications, benefiting from integrated agentic tooling, fully managed infrastructure, and enterprise-grade security.
By building on Vertex AI, you can:

Orchestrate sophisticated multi-agent systems: Build agents with an open approach using Google’s Agent Development Kit (ADK) or your preferred framework. Deploy your agents to production with enterprise-grade controls directly in Agent Engine.
Harness the power of Google Cloud integrations: You can connect Claude directly within BigQuery ML to facilitate functions like text generation, summarization, translation, and more.
Optimize performance with provisioned throughput: Reserve dedicated capacity and prioritized processing for critical production workloads with Claude models at a fixed fee. To get started with provisioned throughput, contact your Google Cloud sales representative.
Maximize Claude model utilization: Reduce latency and costs while increasing throughput by employing Vertex AI’s advanced features for Claude models such as batch predictions, prompt caching, token counting, and citations. For detailed information, refer to our documentation.
Scale with fully managed infrastructure: Vertex AI’s fully managed and AI-optimized infrastructure simplifies how you deploy your AI workloads in production. Additionally, Vertex AI’s new global endpoints for Claude (public preview) enhance availability by dynamically serving traffic from the nearest available region.
Build confidently with enterprise-grade security and compliance: Benefit from Vertex AI’s built-in security and compliance measures that satisfy stringent enterprise requirements.

Customers achieving real impact with Claude on Vertex AI
To date, more than 4,000 customers have started using Anthropic’s Claude models on Vertex AI. Here’s a look at how top organizations are driving impactful results with this powerful integration:
Augment Code is running its AI coding assistant, which specializes in helping developers navigate and contribute to production-grade codebases, with Anthropic’s Claude models on Vertex AI.
“What we’re able to get out of Anthropic is truly extraordinary, but all of the work we’ve done to deliver knowledge of customer code, used in conjunction with Anthropic and the other models we host on Google Cloud, is what makes our product so powerful.” – Scott Dietzen, CEO, Augment Code
Palo Alto Networks is accelerating software development and security by deploying Claude on Vertex AI.
“With Claude running on Vertex AI, we saw a 20% to 30% increase in code development velocity. Running Claude on Google Cloud’s Vertex AI not only accelerates development projects, it enables us to hardwire security into code before it ships.” – Gunjan Patel, Director of Engineering, Office of the CPO, Palo Alto Networks
Replit leverages Claude on Vertex AI to power Replit Agent, which empowers people across the world to use natural language prompts to turn their ideas into applications, regardless of coding experience.
“Our AI agent is made more powerful through Anthropic’s Claude models running on Vertex AI. This integration allows us to easily connect with other Google Cloud services, like Cloud Run, to work together behind the scenes to help customers turn their ideas into apps.” – Amjad Masad, Founder and CEO, Replit
Get started
To get started with the new Claude models on Vertex AI, navigate to the Claude Opus 4 or the Claude Sonnet 4 model card in Vertex AI Model Garden, select “Enable”, and follow the proceeding instructions.
You can also find and easily procure Claude Opus 4 and Claude Sonnet 4 on Google Cloud Marketplace.
Explore our sample notebook and documentation to start building.

AI Summary and Description: Yes

**Summary:** The text discusses the expansion of third-party models in Google Cloud’s Vertex AI Model Garden with the introduction of Anthropic’s Claude Opus 4 and Claude Sonnet 4. These new hybrid reasoning models enhance AI capabilities for coding, research, and automation in enterprise environments, emphasizing their integration with robust security and compliance features.

**Detailed Description:**

The integration of Anthropic’s Claude models into Google Cloud’s Vertex AI Model Garden represents a significant advancement in both AI capabilities and security for enterprise users. The introduction of Claude Opus 4 and Claude Sonnet 4 is particularly noteworthy for professionals in AI security, cloud computing, and software development.

**Key Points:**
– **Claude Opus 4:**
– Offers the highest performance among the Claude models.
– Designed for advanced coding tasks, autonomous AI agents, and complex problem-solving.
– Capable of handling long-running tasks with precise management of content.

– **Claude Sonnet 4:**
– A mid-sized model that delivers a balance between performance and cost, surpassing its predecessor Claude Sonnet 3.7.
– Suitable for tasks such as code reviews, bug fixes, AI assistants, and efficient research.

– **Deployment on Vertex AI:**
– Both models are available as Model-as-a-Service (MaaS), facilitating easy integration and deployment in existing AI workflows.
– Vertex AI combines data, models, and agents, providing a comprehensive platform that streamlines production AI processes.

– **Building Advanced Agents:**
– Users can develop sophisticated multi-agent systems using Google’s Agent Development Kit (ADK) or other preferred frameworks.
– The platform supports enterprises with tools for agent deployment and management, ensuring robust security controls.

– **Integration with Google Cloud Services:**
– Claude models can connect with services like BigQuery ML, enabling text generation and data processing capabilities.
– Users can reserve dedicated capacity for critical workloads, optimizing performance and managing costs effectively.

– **Security and Compliance:**
– The Vertex AI platform emphasizes enterprise-grade security measures that meet stringent compliance requirements, offering confidence for businesses integrating these models into their operations.

– **Case Studies:**
– Companies like Augment Code and Palo Alto Networks report substantial improvements in development velocity and security effectiveness through the deployment of Claude models, demonstrating their practical impact in real-world scenarios.

– **Getting Started:**
– Users are guided to access the new Claude models in Vertex AI Model Garden, with support materials available to assist in building and deploying AI applications.

This text serves as an important note for security and compliance professionals, highlighting advancements in AI model capabilities while ensuring that enterprise-grade security measures are placed at the forefront of AI developments.

2 3 4 7 a aaS access Act ads advanced coding advancement advancements agent agent deployment agent development Agent Development Kit Agent Engine agent system agent systems agent workflows agents AGI AI AI applications AI assistants AI development ai model AI security AI workloads alt analysis and Anthropic Anthropic’s Claude Anthropics anti API app Application applications Arch art as assistant assistants Augment Auto automation autonomous availability Bi BigQuery Bug bug fixes building built business by C caching capabilities capacity CI CIA citations Claude Claude model Claude Sonnet Cloud cloud computing cloud integration Cloud Run cloud service cloud services co code code review code reviews codebase Codebases coding coding assistant coding tasks companies complex problem compliance compliance measures compliance professionals compliance requirements Computing content Content Generation content management control controls cost Costs CoT critical cross Customer D data data processing day de deep demo deployment design developer developers development development velocity developments document documentation Driving e e-learning edge effective effectiveness efficient election end endpoint endpoints Engineer engineering enterprise enterprise environments enterprise use enterprise users enterprise-grade security enterprises environment ERP Excel exp Expansion experience eXtended feature features fixes for foundation model foundation models framework frameworks front full function g Gen general generation Go Google Google Cloud Google Cloud Marketplace Google Cloud services grade grade security H high Highlight HP HR http HTTPS hybrid hybrid reasoning Hybrid Reasoning Model in information infrastructure integration integrations Intel intelligent agents io iOS Iron J k Key knowledge l language large latency learning led Li long low M MaaS mac machine made man management market marketplace Materials max measures mid ML Mode model model capabilities model card model family model utilization Model-as-a-Service models multi multi-agent systems N nation natural language natural language prompts needs network networks no notebook o of off on one only open open models operation operations OPM opt orchestrating organization organizations oS out over Palo Alto Palo Alto Networks party performance phi platform platform support point Power pre Preview problem problem-solving process processes processing product production products professionals project projects prompt prompt caching prompts provisioned throughput public Q R rag rate RCE real Real-World Scenarios reasoning reasoning mode reasoning model reasoning models red Region Replit report Requirements research response responses Ro robust security s sales sam Scale search sec security security and compliance security controls security measure security measures service services Sig Sim size software software development solutions solving source specific SSE SSO start summarization support system systems T Task tasks text text generation the thinking third third-party throughput to token token count token counting tool tooling tools Tor TP traffic translation turn two UI under up US use use cases user Users utilization V Vertex Vertex AI Vision Ware Wi workflow workflows workload workloads world world scenarios x Zen