Cloud Blog: Announcing Anthropic’s Claude Opus 4 and Claude Sonnet 4 on Vertex AI

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/anthropics-claude-opus-4-and-claude-sonnet-4-on-vertex-ai/
Source: Cloud Blog
Title: Announcing Anthropic’s Claude Opus 4 and Claude Sonnet 4 on Vertex AI

Feedly Summary: Today, we’re expanding the choice of third-party models available in Vertex AI Model Garden with the addition of Anthropic’s newest generation of the Claude model family: Claude Opus 4 and Claude Sonnet 4. Both Claude Opus 4 and Claude Sonnet 4 are hybrid reasoning models, meaning they offer modes for near-instant responses and extended thinking for deeper reasoning.  
Claude Opus 4 is Anthropic’s most powerful model to date. Claude Opus 4 excels at coding, with sustained performance on complex, long-running tasks and agent workflows. Use cases include advanced coding work, autonomous AI agents, agentic search and research, tasks that require complex problem solving, and long-running tasks that require precise content management. 
Claude Sonnet 4 is Anthropic’s mid-size model that balances performance with cost. It surpasses its predecessor, Claude Sonnet 3.7, across coding and reasoning while responding more precisely to steering. Use cases include coding tasks such as code reviews and bug fixes, AI assistants, efficient research, and large-scale content generation and analysis.
Claude Opus 4 and Claude Sonnet 4 are generally available as a Model-as-a-Service (MaaS) offering on Vertex AI. For more information on the newest Claude models, visit Anthropic’s blog.
Build advanced agents on Vertex AI
Vertex AI is Google Cloud’s comprehensive platform for orchestrating your production AI workflows across three pillars: data, models, and agents—a combination that would otherwise require multiple fragmented solutions. A key component of the model pillar is Vertex AI Model Garden, which offers a curated selection of over 200 foundation models, including Google’s models, third-party models, and open models—empowering you to choose the ideal solution for your specific needs.
You can leverage Vertex AI’s Model-as-a-Service (MaaS) to rapidly deploy and scale Claude-powered intelligent agents and applications, benefiting from integrated agentic tooling, fully managed infrastructure, and enterprise-grade security.
By building on Vertex AI, you can: 

Orchestrate sophisticated multi-agent systems: Build agents with an open approach using Google’s Agent Development Kit (ADK) or your preferred framework. Deploy your agents to production with enterprise-grade controls directly in Agent Engine. 
Harness the power of Google Cloud integrations: You can connect Claude directly within BigQuery ML to facilitate functions like text generation, summarization, translation, and more.
Optimize performance with provisioned throughput: Reserve dedicated capacity and prioritized processing for critical production workloads with Claude models at a fixed fee. To get started with provisioned throughput, contact your Google Cloud sales representative.
Maximize Claude model utilization: Reduce latency and costs while increasing throughput by employing Vertex AI’s advanced features for Claude models such as batch predictions, prompt caching, token counting, and citations. For detailed information, refer to our documentation.
Scale with fully managed infrastructure: Vertex AI’s fully managed and AI-optimized infrastructure simplifies how you deploy your AI workloads in production. Additionally, Vertex AI’s new global endpoints for Claude (public preview) enhance availability by dynamically serving traffic from the nearest available region.
Build confidently with enterprise-grade security and compliance: Benefit from Vertex AI’s built-in security and compliance measures that satisfy stringent enterprise requirements.

Customers achieving real impact with Claude on Vertex AI
To date, more than 4,000 customers have started using Anthropic’s Claude models on Vertex AI. Here’s a look at how top organizations are driving impactful results with this powerful integration:
Augment Code is running its AI coding assistant, which specializes in helping developers navigate and contribute to production-grade codebases, with Anthropic’s Claude models on Vertex AI.
“What we’re able to get out of Anthropic is truly extraordinary, but all of the work we’ve done to deliver knowledge of customer code, used in conjunction with Anthropic and the other models we host on Google Cloud, is what makes our product so powerful.” – Scott Dietzen, CEO, Augment Code 
Palo Alto Networks is accelerating software development and security by deploying Claude on Vertex AI.
“With Claude running on Vertex AI, we saw a 20% to 30% increase in code development velocity. Running Claude on Google Cloud’s Vertex AI not only accelerates development projects, it enables us to hardwire security into code before it ships.” – Gunjan Patel, Director of Engineering, Office of the CPO, Palo Alto Networks
Replit leverages Claude on Vertex AI to power Replit Agent, which empowers people across the world to use natural language prompts to turn their ideas into applications, regardless of coding experience.
“Our AI agent is made more powerful through Anthropic’s Claude models running on Vertex AI. This integration allows us to easily connect with other Google Cloud services, like Cloud Run, to work together behind the scenes to help customers turn their ideas into apps.” – Amjad Masad, Founder and CEO, Replit
Get started 
To get started with the new Claude models on Vertex AI, navigate to the Claude Opus 4 or the Claude Sonnet 4 model card in Vertex AI Model Garden, select “Enable”, and follow the proceeding instructions. 
You can also find and easily procure Claude Opus 4 and Claude Sonnet 4 on Google Cloud Marketplace.
Explore our sample notebook and documentation to start building.

AI Summary and Description: Yes

**Summary:** The text discusses the expansion of third-party models in Google Cloud’s Vertex AI Model Garden with the introduction of Anthropic’s Claude Opus 4 and Claude Sonnet 4. These new hybrid reasoning models enhance AI capabilities for coding, research, and automation in enterprise environments, emphasizing their integration with robust security and compliance features.

**Detailed Description:**

The integration of Anthropic’s Claude models into Google Cloud’s Vertex AI Model Garden represents a significant advancement in both AI capabilities and security for enterprise users. The introduction of Claude Opus 4 and Claude Sonnet 4 is particularly noteworthy for professionals in AI security, cloud computing, and software development.

**Key Points:**
– **Claude Opus 4:**
– Offers the highest performance among the Claude models.
– Designed for advanced coding tasks, autonomous AI agents, and complex problem-solving.
– Capable of handling long-running tasks with precise management of content.

– **Claude Sonnet 4:**
– A mid-sized model that delivers a balance between performance and cost, surpassing its predecessor Claude Sonnet 3.7.
– Suitable for tasks such as code reviews, bug fixes, AI assistants, and efficient research.

– **Deployment on Vertex AI:**
– Both models are available as Model-as-a-Service (MaaS), facilitating easy integration and deployment in existing AI workflows.
– Vertex AI combines data, models, and agents, providing a comprehensive platform that streamlines production AI processes.

– **Building Advanced Agents:**
– Users can develop sophisticated multi-agent systems using Google’s Agent Development Kit (ADK) or other preferred frameworks.
– The platform supports enterprises with tools for agent deployment and management, ensuring robust security controls.

– **Integration with Google Cloud Services:**
– Claude models can connect with services like BigQuery ML, enabling text generation and data processing capabilities.
– Users can reserve dedicated capacity for critical workloads, optimizing performance and managing costs effectively.

– **Security and Compliance:**
– The Vertex AI platform emphasizes enterprise-grade security measures that meet stringent compliance requirements, offering confidence for businesses integrating these models into their operations.

– **Case Studies:**
– Companies like Augment Code and Palo Alto Networks report substantial improvements in development velocity and security effectiveness through the deployment of Claude models, demonstrating their practical impact in real-world scenarios.

– **Getting Started:**
– Users are guided to access the new Claude models in Vertex AI Model Garden, with support materials available to assist in building and deploying AI applications.

This text serves as an important note for security and compliance professionals, highlighting advancements in AI model capabilities while ensuring that enterprise-grade security measures are placed at the forefront of AI developments.