Tag: llm
-
Cloud Blog: Scaling high-performance inference cost-effectively
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/gke-inference-gateway-and-quickstart-are-ga/ Source: Cloud Blog Title: Scaling high-performance inference cost-effectively Feedly Summary: At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway, including support for vLLM on TPUs, Ironwood TPUs, and Anywhere Cache.Β Our inference solution is based on AI Hypercomputer, a system built on our experience running models like…
-
Cloud Blog: Our approach to carbon-aware data centers: Central data center fleet management
Source URL: https://cloud.google.com/blog/topics/sustainability/googles-approach-to-carbon-aware-data-center/ Source: Cloud Blog Title: Our approach to carbon-aware data centers: Central data center fleet management Feedly Summary: Data centers are the engines of the cloud, processing and storing the information that powers our daily lives. As digital services grow, so do our data centers and we are working to responsibly manage them.…