Tag: inference gateway
-
Cloud Blog: Scaling high-performance inference cost-effectively
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/gke-inference-gateway-and-quickstart-are-ga/ Source: Cloud Blog Title: Scaling high-performance inference cost-effectively Feedly Summary: At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway, including support for vLLM on TPUs, Ironwood TPUs, and Anywhere Cache. Our inference solution is based on AI Hypercomputer, a system built on our experience running models like…
-
Cloud Blog: Google is a Leader in the 2025 Gartner® Magic Quadrant™ for Container Management
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/2025-gartner-magic-quadrant-for-container-management-leader/ Source: Cloud Blog Title: Google is a Leader in the 2025 Gartner® Magic Quadrant™ for Container Management Feedly Summary: We’re excited to share that Gartner has recognized Google as a Leader for the third year in a row in the 2025 Gartner® Magic Quadrant™ for Container Management, based on its Completeness of…