Tag: accelerators
-
The Register: Microsoft CTO says he wants to swap most AMD and Nvidia GPUs for homemade chips
Source URL: https://www.theregister.com/2025/10/02/microsoft_maia_dc/ Source: The Register Title: Microsoft CTO says he wants to swap most AMD and Nvidia GPUs for homemade chips Feedly Summary: Pivot will hinge on success of next-gen Maia accelerator Microsoft buys a lot of GPUs from both Nvidia and AMD. But moving forward, Redmond’s leaders want to shift the majority of…
-
Cloud Blog: GPUs when you need them: Introducing Flex-start VMs
Source URL: https://cloud.google.com/blog/products/compute/introducing-flex-start-vms-for-the-compute-engine-instance-api/ Source: Cloud Blog Title: GPUs when you need them: Introducing Flex-start VMs Feedly Summary: Innovating with AI requires accelerators such as GPUs that can be hard to come by in times of extreme demand. To address this challenge, we offer Dynamic Workload Scheduler (DWS), a service that optimizes access to compute resources…
-
Cloud Blog: Accelerating cloud migrations to Google Cloud with Searce to drive profitable growth
Source URL: https://cloud.google.com/blog/products/containers-kubernetes/recent-migrations-to-google-cloud-by-searce/ Source: Cloud Blog Title: Accelerating cloud migrations to Google Cloud with Searce to drive profitable growth Feedly Summary: As companies transition past legacy infrastructure and set themselves up for growth in AI, multi-cloud, and platform engineering requirements, many are looking to Google Cloud for its reliability, performance, and cost benefits.To achieve successful…
-
Cloud Blog: GKE network interface at 10: From core connectivity to the AI backbone
Source URL: https://cloud.google.com/blog/products/networking/gke-network-interface-from-kubenet-to-ebpfcilium-to-dranet/ Source: Cloud Blog Title: GKE network interface at 10: From core connectivity to the AI backbone Feedly Summary: It’s hard to believe it’s been over 10 years since Kubernetes first set sail, fundamentally changing how we build, deploy, and manage applications. Google Cloud was at the forefront of the Kubernetes revolution with…
-
Cloud Blog: Scaling high-performance inference cost-effectively
Source URL: https://cloud.google.com/blog/products/ai-machine-learning/gke-inference-gateway-and-quickstart-are-ga/ Source: Cloud Blog Title: Scaling high-performance inference cost-effectively Feedly Summary: At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway, including support for vLLM on TPUs, Ironwood TPUs, and Anywhere Cache. Our inference solution is based on AI Hypercomputer, a system built on our experience running models like…
-
The Register: Alibaba looks to end reliance on Nvidia for AI inference
Source URL: https://www.theregister.com/2025/08/29/china_alibaba_ai_accelerator/ Source: The Register Title: Alibaba looks to end reliance on Nvidia for AI inference Feedly Summary: Chinese cloud provider reportedly joins the homegrown silicon party Alibaba has reportedly developed an AI accelerator amid growing pressure from Beijing to curb the nation’s reliance on Nvidia GPUs. … AI Summary and Description: Yes Summary: The…