machine learning workloads – Experimental News Clipping Site

Cloud Blog: GKE network interface at 10: From core connectivity to the AI backbone

Sep 17, 2025

—

by

Source URL: https://cloud.google.com/blog/products/networking/gke-network-interface-from-kubenet-to-ebpfcilium-to-dranet/ Source: Cloud Blog Title: GKE network interface at 10: From core connectivity to the AI backbone Feedly Summary: It’s hard to believe it’s been over 10 years since Kubernetes first set sail, fundamentally changing how we build, deploy, and manage applications. Google Cloud was at the forefront of the Kubernetes revolution with…

Cloud Blog: From clicks to clusters: Expanding Confidential Computing with Intel TDX

Aug 29, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/identity-security/from-clicks-to-clusters-confidential-computing-expands-with-intel-tdx/ Source: Cloud Blog Title: From clicks to clusters: Expanding Confidential Computing with Intel TDX Feedly Summary: Privacy-protecting Confidential Computing has come a long way since we introduced Confidential Virtual Machines (VMs) five years ago. The technology, which can protect data while in use, strengthens a security gap beyond data encryption at rest…

Cloud Blog: Taming the stragglers: Maximize AI training performance with automated straggler detection

Aug 11, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/compute/stragglers-in-ai-a-guide-to-automated-straggler-detection/ Source: Cloud Blog Title: Taming the stragglers: Maximize AI training performance with automated straggler detection Feedly Summary: Stragglers are an industry-wide issue for developers working with large-scale machine learning workloads. The larger and more powerful these systems become, the more their performance is hostage to the subtle misbehavior of a single component.…

Cloud Blog: Now GA: C4 VMs with Local SSD, bare metal, and larger shapes, on Intel Xeon 6

Jul 31, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/compute/c4-vms-based-on-intel-6th-gen-xeon-granite-rapids-now-ga/ Source: Cloud Blog Title: Now GA: C4 VMs with Local SSD, bare metal, and larger shapes, on Intel Xeon 6 Feedly Summary: We’re thrilled to announce a significant expansion of our C4 virtual machine series, with the general availability of 28 powerful new shapes. This expansion introduces C4 shapes with Google’s next-gen…

Cloud Blog: Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs

Jul 28, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/compute/dynamic-workload-scheduler-calendar-mode-reserves-gpus-and-tpus/ Source: Cloud Blog Title: Understanding Calendar mode for Dynamic Workload Scheduler: Reserve ML GPUs and TPUs Feedly Summary: Organizations need ML compute resources that can accommodate bursty peaks and periodic troughs. That means the consumption models for AI infrastructure need to evolve to be more cost-efficient, provide term flexibility, and support rapid…

AWS News Blog: New Amazon EC2 P6e-GB200 UltraServers accelerated by NVIDIA Grace Blackwell GPUs for the highest AI performance

Jul 9, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://aws.amazon.com/blogs/aws/new-amazon-ec2-p6e-gb200-ultraservers-powered-by-nvidia-grace-blackwell-gpus-for-the-highest-ai-performance/ Source: AWS News Blog Title: New Amazon EC2 P6e-GB200 UltraServers accelerated by NVIDIA Grace Blackwell GPUs for the highest AI performance Feedly Summary: Amazon announces the general availability of EC2 P6e-GB200 UltraServers, powered by NVIDIA Grace Blackwell GB200 superchips that enable up to 72 GPUs with 360 petaflops of computing power for…

The Register: Google offers 7th-gen Ironwood TPUs for AI, with AI-inspired comparisons

Apr 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://www.theregister.com/2025/04/10/googles_7thgen_ironwood_tpus_debut/ Source: The Register Title: Google offers 7th-gen Ironwood TPUs for AI, with AI-inspired comparisons Feedly Summary: Sure, we’re doing FP8 versus a supercomputer’s FP64. What of it? Cloud Next Google’s seventh-generation Tensor Processing Units (TPU), announced Wednesday, will soon be available to cloud customers to rent in pods of 256 or 9,216…

Cloud Blog: Anyscale powers AI compute for any workload using Google Compute Engine

Mar 25, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/anyscale-powers-ai-compute-for-any-workload-using-google-compute-engine/ Source: Cloud Blog Title: Anyscale powers AI compute for any workload using Google Compute Engine Feedly Summary: Over the past decade, AI has evolved at a breakneck pace, turning from a futuristic dream into a tool now accessible to everyone. One of the technologies that opened up this new era of AI…

Cloud Blog: Using RDMA over Converged Ethernet networking for AI on Google Cloud

Mar 19, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/networking/rdma-rocev2-for-ai-workloads-on-google-cloud/ Source: Cloud Blog Title: Using RDMA over Converged Ethernet networking for AI on Google Cloud Feedly Summary: All workloads are not the same. This is especially the case for AI, ML, and scientific workloads. In this blog we show how Google Cloud makes the RDMA over converged ethernet version 2 (RoCE v2)…

Cloud Blog: Accelerate AI/ML workloads using Cloud Storage hierarchical namespace

Mar 17, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/storage-data-transfer/cloud-storage-hierarchical-namespace-improves-aiml-checkpointing/ Source: Cloud Blog Title: Accelerate AI/ML workloads using Cloud Storage hierarchical namespace Feedly Summary: As AI and machine learning (ML) workloads continue to grow, the infrastructure supporting them must evolve to meet their unique demands. Here on the Google Cloud Storage team, we’re committed to providing AI/ML practitioners with tools to optimize…

Tag: machine learning workloads