Anywhere Cache – Experimental News Clipping Site

Cloud Blog: 5 best practices for Managed Lustre on Google Kubernetes Engine

Sep 19, 2025

—

by

Source URL: https://cloud.google.com/blog/products/containers-kubernetes/gke-managed-lustre-csi-driver-for-aiml-and-hpc-workloads/ Source: Cloud Blog Title: 5 best practices for Managed Lustre on Google Kubernetes Engine Feedly Summary: Google Kubernetes Engine (GKE) is a powerful platform for orchestrating scalable AI and high-performance computing (HPC) workloads. But as clusters grow and jobs become more data-intensive, storage I/O can become a bottleneck. Your powerful GPUs and…

Cloud Blog: Scaling high-performance inference cost-effectively

Sep 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/ai-machine-learning/gke-inference-gateway-and-quickstart-are-ga/ Source: Cloud Blog Title: Scaling high-performance inference cost-effectively Feedly Summary: At Google Cloud Next 2025, we announced new inference capabilities with GKE Inference Gateway, including support for vLLM on TPUs, Ironwood TPUs, and Anywhere Cache. Our inference solution is based on AI Hypercomputer, a system built on our experience running models like…

Cloud Blog: Scalable AI starts with storage: Guide to model artifact strategies

Aug 14, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/topics/developers-practitioners/scalable-ai-starts-with-storage-guide-to-model-artifact-strategies/ Source: Cloud Blog Title: Scalable AI starts with storage: Guide to model artifact strategies Feedly Summary: Managing large model artifacts is a common bottleneck in MLOps. Baking models into container images leads to slow, monolithic deployments, and downloading them at startup introduces significant delays. This guide explores a better way: decoupling your…

Cloud Blog: Selecting the right Hyperdisk block storage for your workloads

Jun 11, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/storage-data-transfer/how-to-choose-the-right-hyperdisk-block-storage-for-your-use-case/ Source: Cloud Blog Title: Selecting the right Hyperdisk block storage for your workloads Feedly Summary: As you adopt Google Cloud or migrate to the latest Compute Engine VMs or to Google Kubernetes Engine (GKE), selecting the right block storage for your workload is crucial. Hyperdisk, Google Cloud’s workload-optimized block storage that’s designed…

Cloud Blog: High performance storage innovations for your AI workloads

Apr 10, 2025

—

by

Kurt Seifried

in Uncategorized

Source URL: https://cloud.google.com/blog/products/storage-data-transfer/high-performance-storage-innovations-for-ai-hpc/ Source: Cloud Blog Title: High performance storage innovations for your AI workloads Feedly Summary: The high-performance storage stack in AI Hypercomputer incorporates learnings from geographic regions, zones, and GPU/TPU architectures, to create an agile, economical, integrated storage architecture. Recently, we’ve made several innovations to improve accelerator utilization with high-performance storage, helping you…

Tag: Anywhere Cache

Cloud Blog: 5 best practices for Managed Lustre on Google Kubernetes Engine

Cloud Blog: Scaling high-performance inference cost-effectively

Cloud Blog: Scalable AI starts with storage: Guide to model artifact strategies

Cloud Blog: Selecting the right Hyperdisk block storage for your workloads

Cloud Blog: High performance storage innovations for your AI workloads